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Title Of Invention 

Cloned Genome Of Infectious 
Hepatitis C Virus of Genotype 2a And Uses Thereof 

Field Of Invention 

The present invention relates to molecular 
approaches to the production of nucleic acid sequence 
which comprises the genome of infectious hepatitis C 
virus. In particular, the invention provides a nucleic 
acid sequence which comprises the genome of an 
ijnfectious hepatitis C virus of genotype 2a. The 
invention therefore relates to the use of the nucleic 
acid sequence and polypeptides encoded by all or part of 
the sequence in the development of vaccines and 
diagnostic assays for HCV and in the development of 
screening assays for the identification of antiviral 
agents for HCV. 

20 Background Of Invention 

Hepatitis C virus (HCV) has a positive-sense 
single-strand RNA genome and is a member of the genus 
Hepacivirus within the Flavivirldae family of viruses 
25 (Rice, 1996) . As for all positive-stranded RNA viruses, 
the genome of HCV functions as mRNA from which all viral 
proteins necessary for propagation are translated. 

The viral genome of HCV is approximately 9600 
nucleotides (nts) in length and consists of a highly 

30 

conserved 5' untranslated region (UTR) , a single long 
open reading frame (ORF) of approximately 9,000 nts and 
a complex 3' UTR. The 5' UTR contains an internal 
ribosomal entry site (Tsukiyama-Kohara et al., 1992; 

35 
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Honda et al. r 1996). The 3 f UTR consists of a short 
variable region, a polypyrimidine tract of variable 
length and, at the 3' end, a highly conserved region of 
approximately 100 nucleotides (Kolykhalov et al., 1996; 
Tanaka et al., 1995; Tanaka et al., 1996; Yamada et al., 
1996) . The last 46 nucleotides of this conserved region 
were predicted to form a stable stem-loop structure 
thought to be critical for viral replication (Blight and 
Rice, 1997; Ito and Lai, 1997; Tsuchihara et al., 1997). 
The ORF encodes a large polypeptide precursor that is 
cleaved into at least 10 proteins by host and viral 
proteinases (Rice, 1996) . The predicted envelope 
proteins contain several conserved N-linked 
glycosylation sites and cysteine residues (Okamoto et 
al., 1992a). The NS3 gene encodes a serine protease and 
an RNA helicase and the NS5B gene encodes an RNA- 
dependent RNA polymerase. 

A remarkable characteristic of HCV is its 
genetic heterogeneity, which is manifested throughout 
the genome (Bukh et al., 1995). The most heterogeneous 
regions of the genome are found in the envelope genes, 
in particular the hypervariable region 1 (HVR1) at the 
N-terminus of E2 (Hijikata et al., 1991; Weiner et al., 
1991) . HCV circulates as a quasispecies of closely 
related genomes in an infected individual. Globally, 
six major HCV genotypes (genotypes 1-6) and multiple 
30 subtypes (a, b, c, etc.) have been identified (Bukh et 
al., 1993; Simmonds et al., 1993). 

The nucleotide and deduced amino acid 
sequences among isolates within a quasispecies generally 
differ by < 2%, whereas those between isolates of 
different genotypes vary by as much as 35%. Genotypes 
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1, 2 and 3 are found worldwide and constitute more than 
90% of the HCV infections in North and South America, 
Europe, Russia, China, Japan and Australia (Forns and 
Bukh, 1998) . Throughout these regions genotype 1 
5 accounts for the majority of HCV infections but 
genotypes 2 and 3 each account for 5-15%. 

At present, more than 80% of individuals 
infected with HCV become chronically infected and these 
chronically infected individuals have a relatively high 

10 

risk of developing chronic hepatitis, liver cirrhosis 
and hepatocellular carcinoma (Hoofnagle, 1997) . The 
only effective therapy for chronic hepatitis C, 
interferon (IFN), alone or in combination with 
15 ribavirin, induces a sustained response in less than 50% 
of treated patients (Davis et al., 1998; McHutchinson et 
al., 1998). Consequently, HCV is currently the most 
common cause of end stage liver failure and the reason 
for about 30% of liver transplants performed in the U.S. 

20 

(Hoofnagle, 1997) . In addition, a number of' recent 
studies suggested that the severity of liver disease and 
the outcome of therapy may be genotype-dependent 
(reviewed in Bukh et al., 1997), In particular, these 
25 studies suggested that infection with HCV genotype lb 

was associated with more severe liver disease (Brechot, 
1997) and a poorer response to IFN therapy (Fried and 
Hoofnagle, 1995) . As a result of the inability to 
develop a universally effective therapy against HCV 
infection, it is estimated that there are still more 
than 25,000 new infections yearly in the U.S. (Alter 
1997) Moreover, since there is no vaccine for HCV, HCV 
remains a serious public health problem. 
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Despite the intense interest in the 
development of vaccines and therapies for HCV, progress 
has been hindered by the absence of a useful cell 
culture system and the lack of any small animal model 
5 for laboratory study. For example, while replication of 
HCV in several cell lines has been reported, such 
observations have turned out not to be highly 
reproducible. In addition, the chimpanzee is the only 
animal model, other than man, for this disease. 

10 

Consequently, HCV has been studied only by using 
clinical materials obtained from patients or 
experimentally infected chimpanzees, an animal model 
whose availability is very limited. 
15 However, several researchers have recently 

reported the construction of infectious cDNA clones of 
HCV, the identification of which would permit a more 
effective search for susceptible cell lines and 
facilitate molecular analysis of the viral genes and 

20 

their function. For example, Yoo et al., and Dash et 
al., (1997) (1995) reported that RNA transcripts from 
cDNA clones of HCV-1 (genotype la) and HCV-N (genotype 
lb) , respectively, resulted in viral replication after 
25 transfection into human hepatoma cell lines. 

Unfortunately, the viability of these clones was not 
tested in vivo and concerns were raised about the 
infectivity of these cDNA clones in vitro (Fausto, 
1997) . In addition, both clones did not contain the 
terminal 98 conserved nucleotides at the very 3' end of 
the UTR. 

Kolykhalov et al., (1997) and Yanagi et al. 
(1997, 1998) reported the derivation from HCV strains 

35 

H77 (genotype la) and HC-J4 (genotype lb) of cDNA clones 
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of HCV that are infectious for chimpanzees. However, 
while these infectious clones will aid in studying HCV 
replication and pathogenesis and will provide an 
important tool for development of in vitro replication 
5 and propagation systems, it is important to have 

infectious clones of more than one genotype, given the 
extensive genetic heterogeneity of HCV and the potential 
impact of such heterogeneity on the development of 
effective therapies and vaccines for HCV. 

10 

In addition, synthetic chimeric viruses can be 
used to map the functional regions of viruses with 
different phenotypes. In flaviviruses and pestiviruses, 
infectious chimeric viruses have been successfully 
15 engineered to express different functional units of 
related viruses (Bray and Lai, 1991; Pletnev et al., 
1992, 1998; Vassilev et al., 1997) and in some cases it 
has been possible to make chimeras between non-related 
or distantly related viruses. For instance, the IRES 

20 

element of poliovirus or bovine viral diarrhea virus has 
been replaced with IRES sequences from HCV (Frolov et 
al., 1998; Lu and Wimmer, 1996; Zhao et al., 1999). 
Recently, the construction of an infectious chimera of 
25 two closely related HCV subtypes has been reported. The 
chimera contained the complete ORF of a genotype lb 
strain but had the 5' and 3' termini of a genotype la 
strain (Yanagi et al., 1998). 

It is important to determine whether chimeras 
constructed from more divergent HCV strains are 
infectious because such chimeras could be used to define 
the functions of viral units and to dissect the immune 
response. 
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Summary Of The Invention 

The present invention relates to nucleic acid 
sequence which comprises the genome of infectious 
hepatitis C virus and in particular, nucleic acid 
sequence which comprises the genome of infectious 
hepatitis C virus of genotype 2a. It is therefore an 
object of the invention to provide nucleic acid sequence 
which encodes infectious hepatitis C virus. Such 
nucleic acid sequence is referred to throughout the 
application as "infectious nucleic acid sequence". 

For the purposes of this application, nucleic 
acid sequence refers to RNA, DNA, cDNA or any variant 
thereof* capable of directing host organism synthesis of 
a hepatitis C virus polypeptide. It is understood that 
nucleic acid sequence encompasses nucleic acid 
sequences, which due to degeneracy, encode the same 
polypeptide sequence as the nucleic acid sequences 
described herein. 

The invention also relates to the use of the 
infectious nucleic acid sequences to produce chimeric 
genomes consisting of portions of the open reading 
frames of nucleic acid sequences of other genotypes 
(including, but not limited to, genotypes 1, 2, 3, 4, 5 
and 6) and subtypes (including, but not limited to, 
subtypes la, lb, 2a, 2b, 2c, 3a, 4a~4f, 5a and 6a) of 
HCV. For example, infectious nucleic acid sequence of 
30 the 2a strain HC-J6, described herein can be used to 
produce chimeras with sequences from the genomes of 
other strains of HCV from different genotypes or 
subtypes. Nucleic acid sequences which comprise 
sequences from two or more HCV genotypes or subtypes are 
designated "chimeric nucleic acid sequences". 



20 



25 



35 



WO 00/75338 



PCT/US00/15446 



- 7 - 

o 

The invention further relates to mutations of 
the infectious nucleic acid sequence of the invention 
where mutation includes, but is not limited to, point 
mutations, deletions and insertions. In one embodiment, 
5 a gene or fragment thereof can be deleted to determine 
the effect of the deleted gene or genes on the 
properties of the encoded virus such as its virulence 
and its .ability to replicate. In an alternative 
embodiment, a mutation may be introduced into the 

10 

infectious nucleic acid sequences to examine the effect 
of the mutation on the properties of the virus. 

The invention also relates to the introduction 
of mutations or deletions into the infectious nucleic 
15 acid sequence in order to produce an attenuated 

hepatitis C virus suitable for vaccine development. 

The invention further relates to the use of 
the infectious nucleic acid sequence to produce 
attenuated viruses via passage in vitro or in vivo of 

20 

the viruses produced by transfection of a host cell with 
the infectious nucleic acid sequence. 

The present invention also relates to the use 
of the nucleic acid sequence of the invention or 

25 fragments thereof in the production of polypeptides 

where "nucleic acid sequence of the invention" refers to 
infectious nucleic acid sequence, mutations of 
infectious nucleic acid sequence, chimeric nucleic acid 
sequence and sequences which comprise the genome of 
attenuated viruses produced from the infectious nucleic 
acid sequence of the invention. In one embodiment, said 
polypeptide or polypeptides are fully or partially 
purified from hepatitis C virus produced by cells 

35 transfected with nucleic acid sequence of the invention. 
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In another embodiment, the polypeptide or polypeptides 
are produced recombinantly from a fragment of the 
nucleic acid sequences of the invention. In yet another 
embodiment, the polypeptides are chemically synthesized. 
5 The polypeptides of the invention, especially 

structural polypeptides, can serve as immunogens in the 
development of vaccines or as antigens in the 
development of diagnostic assays for detecting the 
presence of HCV in biological samples. 

10 

The invention therefore also relates to 
vaccines for use in immunizing mammals especially humans 
against hepatitis C. In one embodiment, the vaccine 
comprises one or more polypeptides made from the nucleic 
15 acid sequence of the invention or fragment thereof. In 
a second embodiment, the vaccine comprises a hepatitis C 
virus produced by transfection of host cells with the 
nucleic acid sequences of the invention. 

The present invention therefore relates to 

20 

methods for preventing hepatitis C in a mammal. In one 
embodiment the method comprises administering to a 
mammal a polypeptide or polypeptides encoded by the 
nucleic acid sequence of the invention in an amount 

25 effective to induce protective immunity to hepatitis C. 
In another embodiment, the method of prevention 
comprises administering to a mammal a hepatitis C virus 
of the invention in an amount effective to induce 
protective immunity against hepatitis C. 

In yet another embodiment, the method of 
protection comprises administering to a mammal the 
nucleic acid sequence of the invention or a fragment 
thereof in an amount effective to induce protective 

35 immunity against hepatitis C. 
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The invention also relates to hepatitis C 
viruses produced by host cells transfected with the 
nucleic acid sequence of the present invention. 

The invention therefore also provides 
5 pharmaceutical compositions comprising the nucleic acid 
sequence of the invention and/or the encoded hepatitis C 
viruses. The invention further provides pharmaceutical 
compositions comprising polypeptides encoded by the 
nucleic acid sequence of the invention or fragments 

10 

thereof. The pharmaceutical compositions of the 
invention may be used prophylactically or 
therapeutically. 

The invention also relates to antibodies to 
15 the hepatitis C virus of the invention or their encoded 
polypeptides and to pharmaceutical compositions 
comprising these antibodies. 

The invention also relates to the use of the 
nucleic acid sequences of the invention to identify cell 

20 

lines capable of supporting the replication of HCV in 
vitro . 

The invention further relates to the use of 
the nucleic acid sequences of the invention or their 
25 encoded viral enzymes (e.g. NS3 serine protease, NS3 
helicase, NS5B RNA polymerase) to develop screening 
assays to identify antiviral agents for HCV. 

Brief Description Of Figures 

30 

Figure 1 shows the amplification and cloning 
of hepatitis C virus genotype 2a (strain HC-J6 C h). The 
nucleotide positions correspond to the sequence of 
PJ6CF, a full length cDNA clone of hepatitis C virus, 
35 genotype 2a, strain HC-J6 C h- Products from polymerase 
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chain reaction are also shown. The names of the clones 
obtained from these products are indicated (number of 
clones sequenced are shown in parenthesis) . The 
composition of the full-length cDNA clone is shown at 
5 the bottom. The restriction enzymes used for cloning 

are indicated. An XJbal site in HOJ6 C h was eliminated by 
a silent substitution at position 5494. 

Figure 2 shows tree analysis of clones 
amplified from an infectious acute phase plasma pool 

10 

generated in a chimpanzee inoculated with human plasma 
containing strain HC-J6 (Okamoto et al., 1991) as well 
as a tree of the predicted polyprotein sequence of 
HOJ6 C h and the infectious HC-J6 C h cDNA clone (pJ6CF) . 
15 The nucleotide positions with deletions or insertions 
were stripped in the analysis of the clones. Multiple 
sequence alignments and tree analyses were performed 
with GeneWorks (Oxford Molecular Group) (Bukh et al., 
1995) . Genotype designations are indicated. Other 

20 

sequences included in the analysis are HC-J8 (Okamoto et 
al., 1992), genotype la infectious clone BEBE1 (Nakao et 
al., 1996), H77C (Yanagi et al., 1997); genotype lb 
infectious clone J4L6S (Yanagi et al., 1998). The scale 
25 in each tree indicates the calculated genetic distance. 
Figure 3 shows the alignment of the 
hypervariable region 1 sequences from 8 J6S clones of 
strain HC-J6 C h- HC-J6 C h represents the consensus amino 
acid sequence of the infectious plasma pool from an 

30 

experimentally infected chimpanzee. HC-J6 is the 
published amino acid sequence of the original inoculum 
(Okamoto et al., 1991). 

Figure 4 shows the construction of four 
35 intertypic chimeric cDNA clones. White boxes are 
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sequences derived from genotype 2a clone pJ6CF r and 

o 

black boxes are sequences derived from genotype la clone 
pCV-H77C (Yanagi et al., 1997). An Ndel site (mutation 
at position 9158 of pCV-H77C) was eliminated and an 
artificial Afctel site (mutation at position 2765 of 
5 pCV-H77C) was created by site-directed mutagenesis; 
silent mutations are underlined. 

Figures 5A and 5B show the alignment of the 
nucleotide sequences of the 5 f (Fig. 5A) and 3' UTRs 
(Fig. 5B) and the amino acid sequences of E2/p7/NS2 

10 

junctions (Fig. 5B) in the intertypic la, 2a chimeric 
cDNA clones. In the 5' UTR alignment, the first 39 nts 
of core believed to be important for the IRES function 
were included (Lemon and Honda, 1997). Top line: the 
15 sequence of the infectious genotype la clone pCV-H77C 

(Yanagi et al., 1997). Bottom line: the sequence of the 
infectious genotype 2a clone pJ6CF. Dot: identity with 
the sequence of H77C. Capital letter: different from the 
sequence of H77C. Dash: deletion. Bold face: initiation 

20 

or stop codon of the ORF. Underlined: Agel cleavage 
site. Arrow: putative sites in the HCV polyprotein 
cleaved by host signal peptidases. Numbering 
corresponds to the sequence of pCV-H77C. 

25 Figures 6A-6F show the nucleotide sequence of 

the infectious hepatitis C virus clone of genotype la 
strain H77C and Figures 6G-6H show the amino acid 
sequence encoded by the clone. 

2Q Figures 7A-7F show the nucleotide sequence of 

the infectious hepatitis C virus clone of genotype lb 
strain HC-J4 and Figures 7G-H show the amino acid 
sequence encoded by the clone. 

35 

SUBSTITUTE SHEET (RULE 26) 
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DESCRIPTION OF THE INVENTION 

The present invention relates to nucleic acid 
sequence which comprises the genome of an infectious 
hepatitis C virus. More specifically, the invention 
relates to nucleic acid sequence which encodes 
infectious hepatitis C virus of strain HC-J6 C h, genotype 
2a. The infectious nucleic acid sequence of the 
invention is shown in SEQ ID NO:l and is contained in a 
plasmid construct deposited with the American Type 
Culture Collection (ATCC) on May 28 f 1999 and having 
ATCC accession number PTA-153. 

The invention also relates to "chimeric 
nucleic acid sequences" where the chimeric nucleic acid 
sequences consist of open-reading frame sequences and/or 
5' and/or 3 r untranslated sequences taken from nucleic 
acid sequences of hepatitis C viruses of different 
genotypes or subtypes. 

In one embodiment , the chimeric nucleic acid 
sequence consists of sequence from the genome of 
infectious HCV of genotype 2a which encodes structural 
polypeptides and sequence from the genome of a HCV of a 
different genotype or subtype which encodes 
nonstructural polypeptides. 

Alternatively, the nonstructural region of 
infectious HCV of genotype 2a and structural region of a 
HCV of a different genotype or subtype may be combined. 
3Q This will result in a chimeric nucleic acid sequence 

consisting of sequence from the genome of infectious HCV 
of genotype 2a which encodes nonstructural polypeptides 
and sequence from the genome of a HCV of a another 
genotype or subtype which encodes structural 
polypeptides . 
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Preferably, the nucleic acid sequence from the 
genome of the infectious HCV clone of genotype la 
(deposited with the ATCC on June 2, 1999 ; Figures 6A- 
6F) , or the nucleic acid sequence from the genome of the 
5 infectious HCV clone of genotype lb (ATCC accession 

number 209596; Figures 7A-7F) is used to construct the 
chimeric nucleic acid sequence with the HCV of genotype 
2a of the invention. 

It is believed that the construction of such 

10 

chimeric nucleic acid sequences will be of importance in 
studying the growth and virulence properties of 
hepatitis C virus and in the production of candidate 
hepatitis C virus vaccines suitable to confer protection 
15 against multiple genotypes of HCV. For example, one 
might produce a "multivalent" vaccine by putting 
epitopes from several genotypes or subtypes into one 
clone. Alternatively one might replace just a single 
gene from an infectious sequence with the corresponding 

20 

gene from the genomic sequence of a strain from another 
genotype or subtype or create a chimeric gene which 
contains portions of a gene from two genotypes or 
subtypes. Examples of genes which could be replaced or 

25 which could be made chimeric, include, but are not 
limited to, the El, E2 and NS4 genes. 

The invention further relates to mutations of 
the infectious nucleic acid sequences where "mutations" 
include, but are not limited to, point mutations, 
deletions and insertions. Of course, one of ordinary 
skill in the art would recognize that the size of the 
insertions would be limited by the ability of the 
resultant nucleic acid sequence to be properly packaged 

35 within the virion. Such mutations could be produced by 



WO 00/75338 



PCT/US00/15446 



10 



- 14 - 

techniques known to those of skill in the art such as 
site-directed mutagenesis, fusion PCR, and restriction 
digestion followed by religation. 

In one embodiment, mutagenesis might be 
undertaken to determine sequences that are important for 
viral properties such as replication or virulence. For 
example, one may introduce a mutation into the 
infectious nucleic acid sequence which eliminates the 
cleavage site between the NS4A and NS4B polypeptides to 
examine the effects on viral replication and processing 
of the polypeptide. 

Alternatively, one may delete all or part of a 
gene or of the 5' or 3' nontranslated region contained in 
15 an infectious nucleic acid sequence and then transfect a 
host cell (animal or cell culture) with the mutated 
sequence and measure viral replication in the host by 
methods known in the art such as RT-PCR. Preferred 
genes include, but are not limited to, the P7, NS4B and 
NS5A genes. Of course, those of ordinary skill in the 
art will understand that deletion of part of a gene, 
preferably the central portion of the gene, may be 
preferable to deletion of the entire gene in order to 
25 conserve the cleavage site boundaries which exist 

between proteins in the HCV polyprotein and which are 
necessary for proper processing of the polyprotein. 

In the alternative, if the transfection is 
into a host animal such as a chimpanzee, one can monitor 
the virulence phenotype of the virus produced by 
transfection of the mutated infectious nucleic acid 
sequence by methods known in the art such as measurement 
of liver enzyme levels (alanine aminotransferase (ALT) 
or isocitrate dehydrogenase (ICD) ) or by histopathology 



30 



35 



WO 00/75338 



PCT/US00/15446 



- 15 - 

o 

of liver biopsies. Thus, mutations of the infectious 
nucleic acid sequences may be useful in the production 
of attenuated HCV strains suitable for vaccine use. 

The invention also relates to the use of the 
5 infectious nucleic acid sequence of the present 

invention to produce attenuated viral strains via 
passage in vitro or in vivo of the virus produced by 
transfection with the infectious nucleic acid sequence. 

The present invention therefore relates to the 

10 

use of the nucleic acid sequence of the invention to 
identify cell lines capable of supporting the 
replication of HCV. 

In particular, it is contemplated that the 
15 mutations of the infectious nucleic acid sequence of the 
invention and the production of chimeric sequences as 
discussed above may be useful in identifying sequences 
critical for cell culture adaptation of HCV and hence, 
may be useful in identifying cell lines capable of 

20 

supporting HCV replication. 

Transfection of tissue culture cells with the 

nucleic acid sequences of the invention may be done by 

methods of transfection known in the art such as 

25 electroporation, precipitation with DEAE-Dextran or 

calcium phosphate or liposomes. 

In one such embodiment, the method comprises 

the growing of animal cells, especially human cells, in 

vitro and transfecting the cells with the nucleic acid 
30 

of the invention, then determining if the cells show 
indicia of HCV infection. Such indicia include the 
detection of viral antigens in the cell, for example, by 
immunofluorescence procedures well known in the art; the 
35 detection of viral polypeptides by Western blotting 
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using antibodies specific therefor; and the detection of 
newly transcribed viral RNA within the cells via methods 
such as RT-PCR. The presence of live, infectious virus 
particles following such tests may also be shown by 
5 injection of cell culture medium or cell lysates into 

healthy, susceptible animals, with subsequent exhibition 
of the signs and symptoms of HCV infection. 

Suitable cells or cell lines for culturing HCV 
include, but are not limited to, lymphocyte and 

10 

hepatocyte cell lines known in the art. 

Alternatively, primary hepatocytes can be 
cultured, and then infected with HCV; or, the hepatocyte 
cultures could be derived from the livers of infected 
15 chimpanzees. In addition, various immortalization 

methods known to those of ordinary skill in the art can 
be used to obtain cell lines derived from hepatocyte 
cultures. For example, primary hepatocyte cultures may 
be fused to a variety of cells to maintain stability. 

20 

The present invention further relates to the 
in vitro and in vivo production of hepatitis C viruses 
from the nucleic acid sequences of the invention. 

In one embodiment, the sequences of the 
25 invention can be inserted into an expression vector that 
functions in eukaryotic cells. Eukaryotic expression 
vectors suitable for producing high efficiency gene 
transfer in vivo are well known to those of ordinary 
skill in the art and include, but are not limited to, 

30 

plasmids, vaccinia viruses, retroviruses, adenoviruses 
and adeno-associated viruses. 

In another embodiment, the sequences contained 
in the recombinant expression vector can be transcribed 
35 in vitro by methods known to those of ordinary skill in 
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the art in order to produce RNA transcripts which encode 
the hepatitis C viruses of the invention. The hepatitis 
C viruses of the invention may then be produced by 
transfecting cells by methods known to those of ordinary 
5 skill in the art with either the in vitro transcription 
mixture containing the RNA transcripts or with the 
recombinant expression vectors containing the nucleic 
acid sequences described herein. 

The hepatitis C viruses produced from the 

10 

sequences of the invention may be purified or partially 
purified from the transfected cells by methods known to 
those of ordinary skill in the art. In a preferred 
embodiment, the viruses are partially purified prior to 
15 their use as immunogens in the pharmaceutical 

compositions and vaccines of the present invention. 

The present invention therefore relates to the 
use of the hepatitis C viruses produced from the nucleic 
acid sequences of the invention as immunogens in live or 

20 

killed ( e.g. , formalin inactivated) vaccines" to prevent 
hepatitis C in a mammal. 

In an alternative embodiment, the immunogen of 
the present invention may be an infectious nucleic acid 
25 sequence, a chimeric nucleic acid sequence, or a mutated 
infectious nucleic acid sequence which encodes a 
hepatitis C virus. Where the sequence is a cDNA 
sequence, the cDNAs and their RNA transcripts may be 
used to transfect a mammal by direct injection into the 

30 

liver tissue of the mammal as described in the Examples. 

Alternatively, direct gene transfer may be 
accomplished via administration of a eukaryotic 
expression vector containing a nucleic acid sequence of 
35 the invention. 
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In yet another embodiment , the immunogen may 
be a polypeptide encoded by the nucleic acid sequences 
of the invention. The present invention therefore also 
relates to polypeptides produced from the nucleic acid 
sequences of the invention or fragments thereof. In one 
embodiment, polypeptides of the present invention can be 
recombinantly produced by synthesis from the nucleic 
acid sequences of the invention or isolated fragments 
thereof, and purified, or partially purified, from 
transfected cells using methods already known in the 
art. In an alternative embodiment, the polypeptides may 
be purified or partially purified from viral particles 
produced via transfection of a host cell with the 
15 nucleic acid sequences of the invention. Such 

polypeptides might, for example, include either capsid 
or envelope polypeptides prepared from the sequences of 
the present invention. 

When used as immunogens, the nucleic acid 
sequences of the invention, or the polypeptides or 
viruses produced therefrom, are preferably partially 
purified prior to use as immunogens in pharmaceutical 
compositions and vaccines of the present invention. 
25 When used as a vaccine, the sequences and the 
polypeptide and virus products thereof, can be 
administered alone or in a suitable diluent, including, 
but not limited to, water, saline, or some type of 
buffered medium. The vaccine according to the present 
invention may be administered to an animal, especially a 
mammal, and most especially a human, by a variety of 
routes, including, but not limited to, intradermally, 
intramuscularly, subcutaneously, or in any combination 
35 thereof. 



20 



WO 00/75338 



PCT/US00/15446 



- 19 - 

o 

Suitable amounts of material to administer for 
prophylactic and therapeutic purposes will vary 
depending on the route selected and the immunogen 
(nucleic acid, virus, polypeptide) administered. One 
5 skilled in the art will appreciate that the amounts to 
be administered for any particular treatment protocol 
can be readily determined without undue experimentation. 
The vaccines of the present invention may be 
administered once or periodically until a suitable titer 

10 

of anti-HCV antibodies appear in the blood. For an 
immunogen consisting of a nucleic acid sequence, a 
suitable amount of nucleic acid sequence to be used for 
prophylactic purposes might be expected to fall in the 

15 range of from about 100 ^ig to about 5 mg and most 

preferably in the range of from about 500 \iq to about 
2mg. For a polypeptide, a suitable amount to use for 
prophylactic purposes is preferably 100 ng to 100 ng and 

20 for a virus 10 2 to 10 6 infectious doses. Such 

administration will, of course, occur prior to any sign 
of HCV infection. 

A vaccine of the present invention may be 

employed in such forms as capsules, liquid solutions, 

25 , . . . . 

suspensions or elixirs for oral administration, or 

sterile liquid forms such as solutions or suspensions. 

An inert carrier is preferably used, such as saline or 

phosphate-buffered saline, or any such carrier in which 

30 the HCV of the present invention can be suitably 

suspended. The vaccines may be in the form of single 

dose preparations or in multi-dose flasks which can be 

utilized for mass-vaccination programs of both animals 

and humans. For purposes of using the vaccines of the 

35 

present invention reference is made to Remington's 
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Pharmaceutical Sciences , Mack Publishing Co,, Easton, 
Pa., Osol (Ed.) (1980); and New Trends and Developments 
in Vaccines , Voller et al. (Eds.), University Park 
Press, Baltimore, Md. (1978), both of which provide much 
5 useful information for preparing and using vaccines. Of 
course, the polypeptides of the present invention, when 
used as vaccines, can include, as part of the 
composition or emulsion, a suitable adjuvant, such as 
alum (or aluminum hydroxide) when humans are to be 

10 

vaccinated, to further stimulate production of 
antibodies by immune cells. When nucleic acids, viruses 
or polypeptides are used for vaccination purposes, other 
specific adjuvants such as CpG motifs (Krieg, A.K. et 
15 al.(1995) and (1996)), may prove useful. 

When the nucleic acids, viruses and 
polypeptides of the present invention are used as 
vaccines or inocula, they will normally exist as 
physically discrete units suitable as a unitary dosage 

20 

for animals, especially mammals, and most especially 
humans, wherein each unit will contain a predetermined 
quantity of active material calculated to produce the 
desired immunogenic effect in association with the 

25 required diluent. The dose of said vaccine or inoculum 
according to the present invention is administered at 
least once. In order to increase the antibody level, a 
second or booster dose may be administered at some time 
after the initial dose. The need for, and timing of, 
such booster dose will, of course, be determined within 
the sound judgment of the administrator of such vaccine 
or inoculum and according to sound principles well known 
in the art. For example, such booster dose could 

35 reasonably be expected to be advantageous at some time 
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between about 2 weeks to about 6 months following the 
initial vaccination. Subsequent doses may be 
administered as indicated. 

The nucleic acid sequences, viruses and 
5 polypeptides of the present invention can also be 

administered for purposes of therapy, where a mammal, 
especially a primate, and most especially a human, is 
already infected, as shown by well known diagnostic 
measures. When the nucleic acid sequences, viruses or 

10 

polypeptides of the present invention are used for such 
therapeutic purposes, much of the same criteria will 
apply as when it is used as a vaccine, except that 
inoculation will occur post-infection. Thus, when the 
15 nucleic acid sequences, viruses or polypeptides of the 
present invention are used as therapeutic agents in the 
treatment of infection, the therapeutic agent comprises 
a pharmaceutical composition containing a sufficient 
amount of said nucleic acid sequences, viruses or 

20 

polypeptides so as to elicit a therapeutically effective 
response in the organism to be treated. Of course, the 
amount of pharmaceutical composition to be administered 
will, as for vaccines, vary depending on the immunogen 
25 contained therein (nucleic acid, polypeptide, virus) and 
on the route of administration. 

The therapeutic agent according to the present 
invention can thus be administered by subcutaneous, 
intramuscular or intradermal routes. One skilled in the 

30 

art will certainly appreciate that the amounts to be 
administered for any particular treatment protocol can 
be readily determined without undue experimentation. Of 
course, the actual amounts will vary depending on the 
35 route of administration as well as the sex, age, and 
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clinical status of the subject which, in the case of 
human patients, is to be determined with the sound 
judgment of the clinician. 

The therapeutic agent of the present invention 
5 can be employed in such forms as capsules, liquid 

solutions, suspensions or elixirs, or sterile liquid 
forms such as solutions or suspensions. An inert carrier 
is preferably used, such as saline, phosphate-buffered 
saline, or any such carrier in which the HCV of the 

10 

present invention can be suitably suspended. The 
. therapeutic agents may be in the form of single dose 
preparations or in the multi-dose flasks which can be 
utilized for mass-treatment programs of both animals and 
15 humans. Of course, when the nucleic acid sequences, 
viruses or polypeptides of the present invention are 
used as therapeutic agents they may be administered as a 
single dose or as a series of doses, depending on the 
situation as determined by the person conducting the 

20 

treatment. 

The nucleic acids, polypeptides and viruses of 
the present invention can also be utilized in the 
production of antibodies against HCV. The term 

25 "antibody" is herein used to refer to immunoglobulin 
molecules and immunologically active portions of 
immunoglobulin molecules. Examples of antibody 
molecules are intact immunoglobulin molecules, 

^ substantially intact immunoglobulin molecules and 

portions of an immunoglobulin molecule, including those 
portions known in the art as Fab, F(ab # )2 and F(v) as 
well as chimeric antibody molecules. 

Thus, the polypeptides, viruses and nucleic 
acid sequences of the present invention can be used in 
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the generation of antibodies that immunoreact (i.e., 
specific binding between an antigenic determinant- 
containing molecule and a molecule containing an 
antibody combining site such as a whole antibody 
molecule or an active portion thereof) with antigenic 
determinants on the surface of hepatitis C virus 
particles. 

The present invention therefore also relates 
to antibodies produced following immunization with the 
nucleic acid sequences, viruses or polypeptides of the 
present invention. These antibodies are typically 
produced by immunizing a mammal with an immunogen or 
vaccine, to induce antibody molecules having 
immunospecif icity for polypeptides or viruses produced 
in response to infection with the nucleic acid sequences 
of the present invention. When used in generating such 
antibodies, the nudleic acid sequences, viruses, or 
polypeptides of the present invention may be linked to 
some type of carrier molecule. The resulting antibody 
molecules are then collected from said mammal. 
Antibodies produced according to the present invention 
have the unique advantage of being generated in response 
to authentic, functional polypeptides produced according 
to the actual cloned HCV genome. 

The antibody molecules of the present 
invention may be polyclonal or monoclonal. Monoclonal 
antibodies are readily produced by methods well known in 
the art. Portions of immunoglobin molecules, such as 
Fabs, as well as chimeric antibodies, may also be 
produced by methods well known to those of ordinary 
skill in the art of generating such antibodies. 
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The antibodies according to the present 
invention may also be contained in blood, plasma, serum, 
hybridoma supernatants, and the like. Alternatively, 
the antibody of the present invention is isolated to the 
extent desired by well known techniques such as, for 
example, using DEAE Sephadex. The antibodies produced 
according to the present invention may be further 
purified so as to obtain specific classes or subclasses 
of antibody such as IgM, IgG, IgA, and the like. 
Antibodies of the IgG class are preferred for purposes 
of passive protection. 

The antibodies of the present invention are 
useful in the prevention and treatment of diseases 
15 caused by hepatitis C virus in animals, especially 
mammals, and most especially humans. 

In providing the antibodies of the present 
invention to a recipient mammal, preferably a human, the 
dosage of administered antibodies will vary depending on 
such factors as the mammal's age, weight, height, sex, 
general medical condition, previous medical history, and 
the like. 

In general, it will be advantageous to provide 
25 the recipient mammal with a dosage of antibodies in the 
range of from about 1 mg/kg body weight to about 10 
mg/kg body weight of the mammal, although a lower or 
higher dose may be administered if found desirable. 
Such antibodies will normally be administered by 

30 

intravenous or intramuscular route as an inoculum. The 
antibodies of the present invention are intended to be 
provided to the recipient subject in an amount 
sufficient to prevent, lessen or attenuate the severity, 
35 extent or duration of any existing infection. 
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The antibodies prepared by use of the nucleic 
acid sequences, viruses or polypeptides of the present 
invention are also highly useful for diagnostic 
purposes. For example, the antibodies can be used as in 
5 vitro diagnostic agents to test for the presence of HCV 
in biological samples taken from animals, especially 
humans. Such assays include, but are not limited to, 
radioimmunoassays, EIA, fluorescence, Western blot 
analysis and ELISAs. In one such embodiment, the 

10 

biological sample is contacted with antibodies of the 
present invention and a labeled second antibody is used 
to detect the presence of HCV to which the antibodies 
are bound. 

15 Such assays may be, for example, direct where 

the labeled first antibody is immunoreactive with the 
antigen, such as, for example, a polypeptide on the 
surface of the virus; indirect where a labeled second 
antibody is reactive with the first antibody; a 

20 

competitive protocol such as would involve the addition 
of a labeled antigen; or sandwich where both labeled and 
unlabeled antibody are used, as well as other protocols 
well known and described in the art. 

25 In one embodiment, an immunoassay method would 

utilize an antibody specific for HCV envelope 
determinants and would further comprise the steps of 
contacting a biological sample with the HCV-specific 
antibody and then detecting the presence of HCV material 
in the test sample using one of the types of assay 
protocols as described above. Polypeptides and 
antibodies produced according to the present invention 
may also be supplied in the form of a kit, either 

35 present in vials as purified material, or present in 
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compositions and suspended in suitable diluents as 
previously described. 

In a preferred embodiment, such a diagnostic 
test kit for detection of HCV antigens in a test sample 
5 comprises in combination a series of containers, each 
container a reagent needed for such assay. Thus, one 
such container would contain a specific amount of HCV- 
specific antibody as already described, a second 
container would contain a diluent for suspension of the 

10 

sample to be tested, a third container would contain a 
positive control and an additional container would 
contain a negative control. An additional container 
could contain a blank. 
15 For all prophylactic, therapeutic and 

diagnostic uses, the antibodies of the invention and 
other reagents, plus appropriate devices and 
accessories, may be provided in the form of a kit so as 
to facilitate ready availability and ease of use. 

20 

The present invention also relates to the use 
of nucleic acid sequences and polypeptides of the 
present invention to screen potential antiviral agents 
for antiviral activity against HCV. Such screening 

25 methods are known by those of skill in the art. 

Generally, the antiviral agents are tested at a variety 
of concentrations, for their effect on preventing viral 
replication in cell culture systems which support viral 
replication, and then for an inhibition of infectivity 
or of viral pathogenicity (and a low level of toxicity) 
in an animal model system. 

In one embodiment, animal cells (especially 
human cells) transfected with the nucleic acid sequences 

35 of the invention are cultured in vitro and the cells are 



WO 00/75338 



PCT/DS00/15446 



- 27 - 

o 

treated with a candidate antiviral agent (a chemical, 
peptide etc.) by adding the candidate agent to the 
medium. The treated cells are then exposed, possibly 
under transfecting or fusing conditions known in the 
5 art, to the nucleic acid sequences of the present 

invention. A sufficient period of time would then be 
allowed to pass for infection to occur, following which 
the presence or absence of viral replication would be 
determined versus untreated control cells by methods 

10 

known to those of ordinary skill in the art. Such 
methods include, but are not limited to, the detection 
of viral antigens in the cell, for example, by 
immunofluorescence procedures well known in the art; the 
15 detection of viral polypeptides by Western blotting 
using antibodies specific therefor; the detection of 
newly transcribed viral RNA within the cells by RT-PCR; 
and the detection of the presence of live, infectious 
virus particles by injection of cell culture medium or 

20 

cell lysates into healthy, susceptible animals, with 
subsequent exhibition of the signs and symptoms of HCV 
infection. A comparison of results obtained for control 
cells (treated only with nucleic acid sequence) with 
25 those obtained for treated cells (nucleic acid sequence 
and antiviral agent) would indicate, the degree, if any, 
of antiviral activity of the candidate antiviral agent. 
Of course, one of ordinary skill in the art would 
readily understand that such cells can be treated with 

30 

the candidate antiviral agent either before or after 
exposure to the nucleic acid sequence of the present 
invention so as to determine what stage, or stages, of 
viral infection and replication said agent is effective 
35 against. 



WO 00/75338 



PCTYUS00/15446 



- 28 - 

o 

In an alternative embodiment , viral enzyme 
such as NS3 protease, NS2-NS3 protease, NS3 helicase or 
NS5B RNA polymerase may be produced from a nucleic acid 
sequence of the invention and used to screen for 
5 inhibitors which may act as antiviral agents. The 

structural and nonstructural regions of the HCV genome, 
including nucleotide and amino acid locations, have been 
determined, for example, as depicted in Houghton, M. 
(1996), Fig. 1; and Major, M.E. et al. (1997), Table 2. 

10 

Such above-mentioned protease inhibitors may 
take the form of chemical compounds or peptides which 
mimic the known cleavage sites of the protease and may 
be screened using methods known to those of skill in the 
15 art (Houghton, M. (1996) and Major, M.E. et al. (1997)). 
For example, a substrate may be employed which mimics 
the protease' s natural substrate, but which provides a 
detectable signal (e.g. by fluorimetric or colorimetric 
methods) when cleaved. This substrate is then incubated 

20 

with the protease and the candidate protease' inhibitor 
under conditions of suitable pH, temperature etc. to 
detect protease activity. The proteolytic activities of 
the protease in the presence or absence of the candidate 

25 inhibitor are then determined. 

In yet another embodiment, a candidate 
antiviral agent (such as a protease inhibitor) may be 
directly assayed in vivo for antiviral activity by 
administering the candidate antiviral agent to a 
chimpanzee transfected with a nucleic acid sequence of 
the invention or infected with a virus of the invention 
and then measuring viral replication in vivo via methods 
such as RT-PCR. Of course, the chimpanzee may be 

35 treated with the candidate agent either before or after 
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transfection with the infectious nucleic acid sequence 
or infected with a virus of the invention so as to 
determine what stage, or stages, of viral infection and 
replication the agent is effective against. 

The invention also provides that the nucleic 
acid sequences, viruses and polypeptides of the 
invention may be supplied in the form of a kit, alone or 
in the form of a pharmaceutical composition. 

All scientific publication and/or patents 
cited herein are specifically incorporated by reference. 
The following examples illustrate various aspects of the 
invention but are in no way intended to limit the scope 
thereof.. 

EXAMPLES 
Materials and Methods 

Source of HCV 

An infectious plasma pool of HCV genotype 2a 
(HC-J6 C h) prepared from acute phase plasma of a 
chimpanzee experimentally inoculated with plasma from a 
Japanese patient infected with strain HC-J6 (Okamoto et 
al., 1991) was used for cloning. An infectious cDNA 
clone of HCV strain H77, genotype la was also used 
(pCV-H77C; Yanagi et al., 1997). 

Amplification, cloning and sequence analysis 

Viral RNA was extracted from 100 ^1 aliquots 
30 of the HC-J6 C h plasma pool with the TRIzol system 

(GIBCO/BRL) (Yanagi et al., 1997). Primers used in cDNA 
synthesis and PCR amplification were based on the 
genomic sequence of strain HC-J6 (Okamoto et al., 1991) 
and from the conserved region (3 f X) of the 3' UTR of HCV 
genotype 2a (Tanaka et al., 1996) (Table 1).. The RNA 
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was denatured at 65°C for 2 min, and cDNA was 
synthesized at 42°C for 1 hour with Superscript II 
reverse transcriptase (GIBCO/BRL) and specific reverse 
primers in 20 jxl reaction volumes. The cDNA mixtures 
were treated with RNase H and RNase Tl (GIBCO/BRL) at 
37°C for 20 min. 
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TABLE 1 

Oligonucleotides used for amplification and cloning 
of strain HC-J6 C h, genotype 2a 



Designation 



2427S-H77 
2426S-H77 
2645R-H77 
2832R-H77 
H2751SII 
H2786R 
H2870R 
H7851S 
H9140S 
H9173R 
H9471R 
J6-H2556S 



{M) 
(M) 



356RF-J6H 

1S-J6E* 

333S-J6 

753R-J6 

2543S-J6F 

2787R-J6(26) 

3329R-J6 

5487-J6F 

5518R-J6F 

9251S-J6F 

9305R-J6F 

9310R-J6F 

9399S-J6F 

9464-J6F 

9470<24>-J6 

J6-3' XR 



Sequence (5' 



3')a 



ACTGGACACGGAGGTGGCCGCGTC 
TTGTTCTTGTCGGGTTAATGGCGC 
GGGTGTACTACACACATGAGTAAG 
AAGCGCCCCTAACTGATGATG 

CGTCATCGATACCTCAGCGGGCATATGCACTGGACACGGA 

GTCCAGT GCATATGC CCGCTGAGG 

CAT GC AC CAGCTGAT AT AG CGCTTGTAAT ATG 

TCCGTAGAGG AAGCTT GCAGCCTGACGCCC 

CAGAGGAGGCAGGGTGCTATATGTGGCAAGTAC 

GTACTTGCCACATATAGCAGCCCTGCCTCCTCTG 

CGTCTCTAGACAGGAAATGGCTTAAGAGGCCGGAGTGTTTACC 

TTAT GGATGCTCATC TTGTTGGGCCAGGCCGAAGCAGCTTTGGAGAACCTCGTAATACT 
CAATGC 

AGGATTTGTGCTCATGGTGCACGGTCTACGAG 

CCGTGCACCATGAGCACAAATCCTAAACCTC 

GGATGTACCCCATGAGGTCGGCAAAG 

GTTTGCGCCTGCTTATGGATGCTCATCTTG 

GCGTCATAA GCATATG CCTGTTGGGG 

CCCTCAGCACTGGAGTACATCTG 

CGTCATGCATACCCCTAGGGCGGCT CTCAT TG AAGAGGG 

CGTCCCCTCTTCAATGAGAGCCGCTCTAGA 

GCGGTGAAGACCAAGCTCAAACTCACTC 

AATCTAGAAGGCGCGCTTCCGGCAATGGAGTGAGTTTGAGC 

CGT CTCTAGAG GATAAATCCAGGAGGCGCGCTTCCGGC 

TACTTTTTGTAGGGGT AGGCCT TTTCC 

CGT CTCTAGA GTGTAGCTAAT GTGT GCCGCTCTA 

CTATGGAGTGTAGCTAATGTGTGC 

CGT CTCTAGA CATGATCTGCAGAGAGACCAGTTACX5GCACTCTCTGFCAGTCATGCGGC 
TCACGGACCTTTCACAGCTAGCCGTGACTAGGGCTAAGATGGAGCCACC 



a HCV-specific sequences are shown in plain text, non HCV-specific 
sequences are shown in bold face, and cleavage sites used for cDNA 
cloning are underlined. 

b The core sequence of the T7 promotor is shown in italics. 



35 



The strategy used to amplify and clone the 
full-length HC-J6 C h sequence is shown in Fig. 1. 
Nucleotide positions correspond to those of the 2a 
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infectious clone (pJ6CF) that is described herein. The 
5' end of HC-J6ch (nts. 17-297, excluding primer 
sequences) was amplified from 2 \xl of cDNA synthesized 
with primer a-2 (Yanagi et al., 1996), PCR was performed 
with AmpliTaq Gold DNA polymerase (Perkin-Elmer) as 
described previously (Yanagi et al., 1996) using primers 
1S-J6F and a-2. After purification, the amplified 
products were cloned into pGEM-T Easy vector (Promega) 
using standard procedures and 5 clones (pJ6-5*UTR) were 
sequenced. 

The 3' end of HC-J6 C h was amplified in 3 
overlapping pieces. RT-PCR of a short fragment of NS5B 
(nts. 9279-9439) was performed with primers 9251S-J6F 
15 and 94 64R-J6F as described above. The PCR products were 
cloned into pGEM-T Easy vector and sequence analysis was 
performed from 5 pJ6-3*F clones. A second region 
spanning from NS5B to the conserved region of the 3' UTR 
(nts. 9376-9629) was amplified in RT-nested PCR 

20 

(external primers H9261F and H3'X58R, internal primers 
H9282F and H3'X45R) .(Yanagi et al., 1997). The amplified 
products were cloned into pGEM-9zf (-) by using tfindlll 
and Xbal sites and 14 pJ6-3'VR clones were sequenced. 

25 The third fragment, which included the 3 1 terminal 
sequence was amplified with primers 9399S-J6F and 
J6-3'XR from one of the pJ6-3'VR clones, and cloned into 
one of the pJ6-3'F clones by using StuI and Xbal sites 

3Q (pJ6-3'X) . 

The ORF of HCV HC-J6 C h was amplified by long 
RT-PCR in 3 overlapping pieces. The amplification was 
performed on 2 \il of the cDNA mixtures with the 
Advantage cDNA polymerase mix (Clontech) (Yanagi et aJ. f 
1997). The J6S fragment (nts. 86-2761) was amplified 
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with primers a-1 (Yanagi et al., 1996) and J6-2787R from 
cDNA synthesized with primer J6-3329R. A single PCR 
round was performed in a Robocycler thermal cycler 
(Stratagene) , and consisted of denaturation at 99°C for 
5 35 sec, annealing at 67°C for 30 sec and elongation at 
68°C for 4 min 30 sec during the first 5 cycles, 5 min 
during the next 10 cycles, 5 min 30 sec during the 
following 10 cycles and 6 min during the last 10 cycles. 
10 The J6B fragment (nts. 2573-5488) was amplified with 
primers 2543S-J6F and 5518R-J6F from cDNA synthesized 
with primer 5518R-J6F. Finally, the J6A fragment (nts. 
5515-9282) was amplified with primers 5487S-J6F and 
9310R-J6F from cDNA synthesized with primer 
9470R(24) -J6F. PCR amplifications of fragments J6B and 
J6A consisted of denaturation at 99°C for 35 sec, 
annealing at 67°C for 30 sec and elongation at 68°C for 6 
min during the first 5 cycles, 7 min during the next 10 
20 cycles, 8 min during the following 10 cycles. and 9 min 
during the last 10 cycles. 

After purification of the long PCR products 
with QIAquick PCR purification kit (QIAGEN) , A-tailing 
2j reactions were performed with AmpliTaq DNA polymerase 
(Perkin Elmer) at 72 °C for 1 hour. The gel-purified 
A-tailed PCR products were cloned into pCR2.1 vector 
(Invitrogen) or pGEM-T Easy vector (Promega) . DH5-alpha 
competent cells (GIBCO BRL) were transformed and 
selected on LB agar plates containing 100 |ig/ml 
ampicillin (SIGMA) and amplified in LB liquid cultures 
at 30°C for 18 - 20 hrs (Yanagi et al., 1997). Midiprep 
was performed using Wizard Plus Midipreps DNA 

35 
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Purification System (Promega) . Multiple clones of the 

o 

J6S, J6A and the J6B fragments were sequenced. 

The consensus sequence of strain HC-J6 C h (nts. 
17-9629) was determined by direct sequencing of PCR 
products (nts. 297-3004 and nts. 4893-5762) and by 
5 sequence analysis of the TA clones (nts. 17-5488 and 
nts. 5515-9629) (Fig. 1). Both strands of DNA were 
sequenced in all cases. Analyses of genomic sequences, 
including multiple sequence alignments and tree 
analyses, were performed with GeneWorks (Oxford 

10 

Molecular Group) (Bukh et al., 1995). 

Construction of chimeric cDNA clones of genotypes la & 
2a 

Four full-length intertypic chimeric cDNA 
^ clones were constructed (Figs. 4, 5A, 5B) . In each clone 
the C, El and E2 genes encoded the consensus amino acid 
sequence of HC-J6 C h- The p7 protein was encoded either by 
the HC-J6 C h or pCV-H77C consensus sequence, and the NS 
20 proteins were all encoded by pCV-H77C genes. To 

engineer these cDNA clones, an Ndel site from pCV-H77C 
was first eliminated by a silent substitution (C to T) 
at position 9158. In brief, two fragments were 
amplified from pCV-H77C with primers H7851S and 

25 

H9173R(M) and with primers H9140S(M) and H9417R (Table 
3), gel-purified and used for fusion PCR with primers 
H7851S and H9417R. The fusion PCR products were cloned 
into pCV-H77C by using Hindlll and Aflll sites. A new 
30 artificial ATdel site was introduced by a silent 

substitution (C to T) at position 2765. PCR products, 
which were amplified from pCV-H77C with primer H2751SII 
containing artificial Clal and Ndel sites and primer 
H2870R, were cloned into the modified pCV-H77C by using 
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Clal and Eco47III sites. The final construct (pH77CV) 
was used as a cassette vector to construct the 
intertypic chimeric HCV cDNA clones. 

The four chimeric cDNA clones were constructed 
5 as follows. pH77CV-J6S (nucleotide sequence shown in 
SEQ ID No: 3 and amino acid sequence shown in SEQ ID 
No:4): The Agel/BsmI fragment of clone J6S2 and the 
Bsml/Ndel fragment of clone J6S1, were cloned into 
pH77CV by using Agel and Ndel sites; pH77 (p7)CV-J6S 

10 

(nucleotide sequence shown in SEQ ID No: 5 and amino acid 
sequence shown in SEQ ID No: 6): A fragment of pH77CV-J6S 
was replaced with a fragment amplified from pCV-H77C 
with primers J6-H2556S and H2786R by using BsaBI and 
15 Ndel sites; J6S (nucleotide sequence shown in SEQ ID 
No:7 and amino acid sequence shown in SEQ ID No:8): A 
fragment amplified from pH77pCV-H77C with primers a-1 
and 356RF-J6H77 and another fragment amplified from 
pH77CV-J6S with primers 333S-J6 and 753R-J6 were 

20 

gel-purified and a fusion-PCR was performed with primers 
a-1 and 753R-J6. The Agel/ Clal fragment of the 
subcloned fusion PCR products and the Clal /Ndel fragment 
of pH77CV-J6S were cloned into pH77CV-J6S by using Agel 
25 and Ndel sites; pH77(p7)-J6S (nucleotide sequence shown 
in SEQ ID No: 9 and amino acid sequence shown in SEQ ID 
No:10): The Agel/Clal fragment of J6S and the Clal/Ndel 
fragment of (p7)CV-J6S were cloned into pH77 (p7) CV-J6S 
by using Agel and Ndel sites. 

30 

Each intertypic chimeric cDNA clone was 
retransformed to select a single clone, and large-scale 
preparation of plasmid DNA was performed with a QIAGEN 
plasmid Maxi kit as described previously (Yanagi et al., 
35 1997). Each of the four cDNA clones was completely 
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sequenced before inoculation. Each clone was 
genetically stable since the digestion pattern was as 
expected following retransf ormation and the complete 
sequence was the expected one. 

Construction of full-length cDNA clone HC-J6 C h 

An overview of the full-length HC-J6 C h clone is 
presented in Fig. 1. In the final construct pJ6CF, 
which encodes the consensus polyprotein of HC-J6 C h/ sin 
Xbal site was eliminated by a silent substitution (A to 
G) at position 5494. Digested fragments containing the 
consensus sequence were purified from the appropriate 
subclones and ligated using the sites indicated. The 
full-length cDNA clone (pJ6CF) was retransf ormed to 
select a single clone, and large-scale preparation of 
plasmid DNA followed by the complete sequence analysis 
was performed. Clone pJ6CF was genetically stable. 

Intrahepatic transfection of chimpanzee with transcribed 
20 RNA 

In duplicate 100 |il reactions, RNA was 
transcribed in vitro with T7 RNA polymerase (Promega) 
from 10 jig of template plasmid linearized with Xbal 
25 (Promega) as described previously (Yanagi et al., 1997). 
The integrity of the RNA was checked by electrophoresis 
through agarose gel stained with ethidium bromide 
(Yanagi et al., 1997). Each transcription mixture was 
diluted with 400 jil of ice-cold phosphate-buffered 
saline without calcium or magnesium and then immediately 
frozen on dry ice and stored at -80°C. Within 24 hours, 
both transcription mixtures were injected into the same 
chimpanzee by percutaneous intrahepatic injection guided 
35 by ultrasound (Yanagi et al., 1998, 1999). If the 
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chimpanzee did not become infected, the same 
transfection was repeated once. After two negative 
results, the next clone was inoculated into the same 
chimpanzee following the same protocol. Injections were 
performed at weeks 0 and 2 with pH77CV-J6S, at weeks 5 
and 8 with pH77 (p7)CV-J6S, at weeks 14 and 16 with 
PH77-J6S, at weeks 19 and 23 with pH77 (p7) -J6S, at week 
28 with.pJ6CF, and finally at week 34 with pCV-H77C. 
The chimpanzee was maintained under conditions that met 
or exceeded all requirements for its use in an approved 
facility. 

Serum samples were collected weekly from the 
chimpanzee and monitored for liver enzyme levels by 
15 standard procedures, anti-HCV antibodies by the 

second-generation ELISA (Abbott) and HCV RNA by a 
sensitive RT-nested PCR assay with AmpliTaq Gold DNA 
polymerase using primers from the 5' UTR (Yanagi et al., 
1996) . Samples were scored as negative for HCV RNA if 

20 

two independent tests on 100 nl of serum were negative. 
The genome equivalent (GE) titer of HCV in positive 
samples was determined by RT-nested PCR on 10-fold 
serial dilutions of the extracted RNA (Bukh et al., 
25 1998). The consensus sequence of the complete ORF from 
the chimpanzee infected with RNA transcripts of pJ6CF 
was determined by direct sequencing of overlapping PCR 
products obtained by long RT-nested PCR as previously 
described (Yanagi et al., 1997) with HC-J6 specific 
primers. After the intrahepatic transfection with RNA 
transcripts of pCV-H77C, we performed H77 (genotype la)- 
specific RT-nested PCR with primers 2427S-H77 and 
2832R-H77 for the 1st round and with primers 2462S-H77 
and 2645R-H77 for the 2nd round (Table 3). The 
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sensitivity of this assay was equivalent to that of the 
assay using 5' UTR primers when testing serum containing 
only H77, genotype la. The genome titer of genotype la 
was determined by using this specific RT- nested PCR on 
10-fold serial dilutions of the extracted RNA. 

EXAMPLE 1 

Sequence analysis of HCV strain HC-J6 C h 

As minor deviations from the consensus amino 
acid sequence were found previously to render 
full-length HCV cDNA clones noninfectious (Yanagi et 
al., 1997 , 1998), the consensus sequence of the cloning 
source of genotype 2a (strain HC-J6 C h) was determined 
15 prior to constructing any full-length clones. In brief, 
a plasma pool containing strain HC-J6 CH was prepared from 
acute phase plasmapheresis units collected from a 
chimpanzee experimentally infected with HC-J6 (Okamoto 
et al., 1991). The HCV genome titer of this pool was 
10 5 ' 4 genome equivalents (GE) /ml (Quantiplex HCV RNA 
bDNA 2.0, Chiron) and the infectivity titer was 10 4 
chimpanzee infectious doses/ml. 

The consensus sequence of the 5' UTR of HC-J6 C h 
25 (nts. 17-340) was deduced from 5 clones containing nts. 

17-297 and 8 clones containing nts. 86-340. The 5' UTR 
of the various clones was highly conserved, but the 
consensus sequence of HC-J6 C h differed by 2 nucleotides 
from that published previously for HC-J6 (Okamoto et 
al., 1991: C to T at position 36 and T to C at position 
222) . 

The consensus sequence of 14 clones of the 3 1 
UTR of HC-J6 CH indicated that the 39 nucleotide long 
variable region was highly conserved in this strain and 
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was identical to that previously published for HC-J6 
(Okamoto et al., 1991). The polypyrimidine tract varied 
greatly in length (84-164 nucleotides), and contained 
some conserved A residues. In the conserved region, the 
5 proximal 16 nucleotides were identical to those 

previously published for isolates of different HCV 
genotypes (Kolykhalov et al., 1996; Tanaka et al., 1996; 
Yamada et al., 1996). The remaining 82 nucleotides of 
the conserved region were determined for other genotype 

10 

2a strains (Tanaka et al., 1996) but not for HC-J6 or 
HC— J6ch« 

The ORF of HC-J6 C h was amplified in 3 fragments 
by RT-PCR (Fig. 1). Eight clones of the J6S fragment 
15 (nts. 86-2761), 6 clones of the J6B fragment (nts. 
2573-5488) and 6 clones of the J6A fragment (nts. 
5515-9298) were sequenced. PCR fragments containing 
nts. 5489-5514 were sequenced directly. A quasispecies 
was found at 243 nucleotide (2.7%) and 69 amino acid 

20 

(2.3%) positions, scattered throughout the 9099 nts 
(3033 aa) of the ORF. However, the majority, 231 
nucleotide substitutions, were detected only once and 
71.6 % of these represented silent mutations. The 12 
25 remaining nucleotide substitutions were each restricted 
to 2 clones and only 4 of these resulted in amino acid 
changes. The nucleotide difference among the J6S clones 
ranged from 0.1 - 1.3%, among the J6B clones it ranged 
from 0.1 - 0.3%, and it ranged from 0.2 - 4.0% among the 

30 

J6A clones (Fig. 2). Three of 8 J6S clones, 4 of 6 J6B 
clones, and all 6 J6A clones had defective polyproteins 
due to nucleotide deletions, insertions or 
substitutions. 
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The sequences of clones of strain HC-J6 C h were 
relatively homogeneous. This was highlighted by the 
high degree of conservation among clones of the HVRl 
(Fig. 3), a region frequently used to study the 
5 quasispecies of HCV (Bukh et al., 1995). An exception 
was the sequence of clone J6A1, which differed by about 
4% from the other clones of this region (Fig. 2) . 
Importantly, the consensus sequence of strain HC-J6 C h 
(nts. 17-9629) could be determined with no ambiguity at 

10 

the nucleotide or deduced amino acid level. The 
difference between the consensus ORF sequence of HC-J6 CH 
from the experimentally infected chimpanzee and that of 
HC-J6 of the inoculum (Okamoto et al., 1991) was 4.1 % 
15 and 2.2 % at the nucleotide and deduced amino acid 

levels, respectively (Fig. 2, Table 2) . Moreover, we 
found that 12 (44.4%) of the 27 amino acids constituting 
HVRl differed between HC-J6 CH and HC-J6 (Fig. 3). Such 
diversities are greater than the < 2 % generally 

20 

considered to comprise a quasispecies. In fact, these 
differences are equivalent to those found between the 
two prototype strains of HCV genotype la [strains HCV-1 
(Choo et al., 1991) and H77 (Yanagi et al., 1997)]. 
25 These results indicated that HC-J6 C h# which represented 
the major species in the experimentally infected 
chimpanzee, was a minor species in the original 
inoculum. 
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TABLE 2 



Percent difference of nucleotide and predicted amino acid sequences 
between strain HC-J6 (Okamoto et ai., 1991) and strain HOJ6ch from 



Genome Region 


nt .position 3 


% nt. 


difference 


% a. a. 


difference 


ORF 


341-9439 


4.1 


<373/9099) b 


2.2 


(66/3033)* 


5' UTR 


17-340 


0.6 


(2/324) 






Core 


341-913 


0.5 


(3/573) 


0 (0/191) 


El. 


914-1489 


4.3 


(25/576) 


2.1 


(4/192) 


HVR1 


1490-1570 


24.7 


(20/81) 


44.4 


(12/27) 


E2-HVR1 


1571-2590 


3.9 


(40/1020) 


3.2 


(11/340) 




2591-2779 


3.7 


(7/189) 


3.2 


(2/63) 


NS2 


2780-3430 


4.0 


(26/651) 


2.8 


(6/217) 


NS3 


3431-5323 


4.0 


(76/1893) 


0.8 


(5/631) 


NS4A 


5324-5485 


4.3 


(7/162) 


1.9 


(1/54) 


NS4B 


5486-6268 


3.7 


(29/783) 


0.4 


(1/261) 


NS5A 


6269-7666 


5.4 


(75/1398) 


3.4 


(16/466) 


NS5B 


7667-9439 


3.7 


(65/1773) 


1.4 


(8/591) 


3' UTR 


9440-9481 


0 (0/42) 
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a The nucleotide positions correspond to those of the infectious 

full-length genotype 2a clone (pJ6CF) . 
b The numbers in parenthesis indicate the nucleotide or amino acid 

differences for each region. 

Example 2 

Chimeric molecular clones 

As chimeric flaviviruses with substituted 
structural genes have been useful in defining the 
biological function of viral sequences or proteins, in 
analyzing immune responses and in generating attenuated 
vaccine candidates (Bray and Lai, 1991; Chambers et ai., 
1999; Pletnev et ai., 1992, 1993, 1998). The consensus 
sequence of the 2a structural genes and surrounding 
region was substituted for that of the infectious la 
cDNA clone. In the genotype la backbone, two silent 
mutations were introduced for cloning purposes [at 
positions 2765 (p7) and 9158 (NS5B) of pCV-H77C] (Fig. 
4) . The complete sequence of each chimera was verified. 
Infectivity of RNA transcripts from four different 
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intertypic chimeric clones (Figs. 4, 5A, 5B) was evaluated 
by consecutive intrahepatic transfections of a chimpanzee. 
Clones were considered not to be viable if viral RNA was not 
detected in the serum within two weeks of the repeat 
transfection. All chimeric clones contained the C, El and E2 
5 genes of genotype 2a. The two chimeric clones tested 

initially differed from each other in that one had the p7 
gene of 2a (pH77CV-J6S) and the other [pH77 (p7) CV-J6S] the 
p7 gene of la. They differed from the two other clones in 
that the 186 nucleotides of the 5' UTR just upstream of the 

10 

initiation codon were from the 2a genotype. Since neither 

clone containing the chimeric 5' UTR was infectious, the 

chimeric 5' UTR was replaced with the consensus genotype la 

5' UTR to generate the two p7 varieties [pH77-J6S and 

15 pH77 (p7) -J6S] . After consecutive transfection of the four 

clones, no HCV RNA, anti-HCV or ALT elevation was detected 

in the chimpanzee during 28 weeks of follow-up, suggesting 

that RNA transcripts from these intertypic chimeric clones 

„ were not viable in vivo. 
20 

This finding that the intertypic clones between 
genotypes la and 2a were not viable was surprising since 
flavivirus chimeras containing the structural region of 
dengue virus type 1 or 2 or of tick-borne encephalitis virus 

25 and the nonstructural region of an infectious dengue type 4 
virus were viable (Bray and Lai, 1991; Pletnev et al., 1992, 
1993) . While considerable sequence variation exists between 
the infectious genotype la and 2a clones of HCV (Table 3), 

3Q these viruses exhibit a higher degree of genetic 

heterogeneity than do the major genotypes of HCV. For other 
f laviviruses, however, it was possible to obtain 
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infectious chimeric clones only if the capsid region was 
derived from the backbone cDNA clone (Chambers et al., 
1999; Pletnev and Men, 1998) . 



TABLE 3 



10 



15 



20 



Percent difference of the amino acid sequences between 
the infectious clone of genotype la (pCV-H77C; 
Yanagi et al., 1997) and the infectious clone of 



Genome Region 0 


% difference 


Polyprotein 


27 .9 (839/3007)" | 


Core 


8.9 (17/191) 


El 


37.0 (71/192) 


HVR1 


59.3 (16/27) 


E2-hvri 


27 .1 (91/336) 


P7 


38.1 (24/63) 


NS2 


41.9 (91/217) 


NS3 


19.2 (121/631) 


NS4A 


33.3 (18/54) 


NS4B 


26.8 (70/261) 


NS5A 


38 .5 (171/444) 


NS5B 


25.2 (149/591) 



a Genome regions defined as in Table 1. 

b The numbers in parenthesis indicate the amino 

acid differences for each region. 

Positions with deletions or insertions in E2 (4 

aa positions) and NS5A (26 aa positions) were 

not considered. 
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Trivial explanations may account for the lack 
of viability of these intertypic chimeras. First, the 
two silent mutations introduced in the genotype la 
backbone (one in p7 and one in NS5B) for cloning 
purposes could potentially eliminate infectivity. This 
is, however, very unlikely since mutations at these 
positions exist among field isolates of HCV including 
strain HC-J6 CH (Bukh et al., 1998). Also, it is 
noteworthy that the three previously published 
infectious clones of strain H77 had numerous silent 
nucleotide differences (Hong et al., 1999; Kolykhalov et 
al., 1997; Yanagi et al., 1997). Second, signal 
peptidases might not cleave the chimeric E2/p7 or p7/NS2 
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junction. This seems unlikely, however , since 
eukaryotic signal peptidases typically recognize the 
amino acid sequences upstream of the cleavage site [the 
(-3, -1) rule] (Nielsen et al., 1997) and the amino 
acids at these two sites are conserved between genotypes 
la and 2a (Fig. 5B) . Finally, the E2/p7 and/or p7/NS2 
gene junctions could differ between genotypes la and 2a. 
The junctions determined for genotypes la and lb were 
used (Lin et al., 1994; Mizushima et al., 1994; Selby et 
al., 1994) because those for genotype 2a have not been 
identified. In the latter two cases, further analyses 
of genotype 2a should eventually provide sufficient data 
to overcome such potential problems and it would most 
likely be possible to construct a viable chimera. 
15 More complicated explanations for the lack of 

viability of the chimeras might be required if critical 
genotype-specific interactions occur as regards the 
structural proteins, the nonstructural proteins and the 
2Q genomic RNA. For instance, one cannot rule out that the 
chimeras were not viable because the IRES function was 
compromised. In in vitro studies the IRES activity 
depended on RNA sequences not only in the 5' UTR but 
also extending 3 f of the translation initiation site 
25 (Hahm et al., 1998; Lemon and Honda, 1997; Reynolds et 
al., 1995). Although the 3 1 border of the HCV IRES is 
still controversial it is believed to involve at most 
the first 39 nts of the core gene (Lemon and Honda, 
30 1997). The 5' UTR of the intertypic chimeras was either 
a chimera of genotype la and 2a sequences or the entire 
5' UTR was derived from the la clone (Figs. 4, 5A) . 
Importantly, the 5* end of core is conserved among 
genotypes la and 2a (Fig. 5A) . Thus, the predicted 

35 
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IRES-like secondary structure is maintained in these 
chimeras, suggesting that the IRES activity most likely 
was maintained. 

Possible interactions between the structural 
5 proteins and the nonstructural proteins and/or the 

genomic RNA, which involve RNA packaging, replication or 
translation are conceivable. In poliovirus, which is 
another positive-sense RNA virus, functional coupling of 
RNA packaging to RNA replication and of RNA replication 

10 

to translation have been suggested (Novak and 
Kirkegaard, 1994 ; Nugent et al., 1999). Similar to 
other viruses of the Flavivxridae family, a membrane- 
associated replicase complex is thought to initiate 
15 replication at the 3' end of HCV and to synthesize a 
complementary negative-strand RNA (Rice, 1996) . The 
putative cis-acting elements at the 5* and 3 1 termini 
which are believed to be important for viral genome 
replication (Rice 1996; Frolov et al., 1998) should be 

20 

maintained in the intertypic HCV chimeras at" least in 
the two constructs with the authentic la 5'UTR. 
However, it is conceivable that the viral packaging 
system was interrupted (Frolov et al., 1998). Studies 
25 using a Kunjin flavivirus replicon system and providing 
the structural proteins in trans suggested that the 
essential encapsidation signals did not reside in the 
structural region of the genome (Khromykh et al., 1997, 
1998) . The location of the packaging signals of HCV is 
not known. However, if the structural proteins 
encapsidate viral RNA via genotype-specific sequences 
outside of the structural region, the chimeras would be 
unable to package the RNA and it might be extremely 
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difficult to construct viable chimeras between highly 
divergent strains . 

Example 3 

A consensus molecular clone of 
genotype 2a is infectious in vivo 

In order to prove that the genotype 2a portion 
used in the 4 intertypic chimeric cDNA clones indeed 
represented the infectious sequence, a consensus full- 
length cDNA clone of HC-J6ch (pJ6CF) was constructed. 
The core sequence of the T7 promoter , a 5' guanosine 
residue and the full-length sequence of HC-J6 C h (9711 
nts) were cloned into pGEM-9Zf vector using Notl/Xbal 
sites. Within the HCV sequence there were no deduced 
amino acid differences and only 4 nucleotide differences 
(at nucleotide positions 1822, 5494, 9247 and 9289) from 
the consensus sequence of HC-J6 C h as determined in the 
present study. The silent mutation at position 1822 was 
within the structural region and so was also present, in 
the four intertypic chimeras. The 5' terminal 16 nts 
and the 3 1 terminal 82 nts were deduced from previously 
published HCV genotype 2a sequences (Okamoto et al., 
25 1991, Tanaka et al., 1996). The full-length cDNA clone 
of genotype 2a contained a 5' UTR of 340 nts, an ORF of 
9099 nts encoding 3033 amino acids and a 3' UTR 
consisting of a variable region of 39 nts followed by a 
132 nucleotide-long polypyrimidine tract interrupted 
with 3 A residues and the 3' terminal conserved region 
of 98 nts. 

RNA transcripts from pJ6CF were injected into 
the same chimpanzee used for injection of the 4 
35 intertypic chimeras. The chimpanzee became infected at 
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the first attempt with an HCV titer of 10 2 GE/ml at week 

1 post inoculation (p.i.), and 10 3 -10 4 GE/mi during weeks 

2 to 6 p.i. The consensus sequence of PCR products of 
the complete ORF, amplified from serum obtained during 

5 week 5 p.i., was identical to the sequence of pJ6CF and 
there was no evidence of a quasispecies . Since RNA 
transcripts of this infectious genotype 2a clone were 
infectious in vivo, and it shared an exact sequence with 
the non-infectious intertypic chimeric clones, their 

10 

failure to replicate must have been the result of 
incompatibilities between 'the genotype la and 2a 
sequences . 

To confirm that the chimpanzee used was 
15 susceptible also to infection by genotype la, which 
comprised most of the intertypic chimeras, the 
chimpanzee was subsequently inoculated with RNA 
transcripts from the infectious genotype la clone 
(pCV-H77C) . Serum samples were tested in an 

20 

H77-specific RT-PCR assay to identify super-infection 
with genotype la. At week 1 p.i. the total HCV genome 
titer was 10 4 GE/ml and the H77-specific (la) genome 
titer was 10 2 GE/ml. The H77-specific genome titer 

25 increased to 10 3 GE/ml at week 2 p.i., and reached 10 4 
GE/ml during weeks 3-6 p.i. The consensus sequence of 
PCR products amplified with H77-specific primers at 
weeks 1-6 p.i. were found to be identical to that of 
pCV-H77C. However , the direct sequences of PCR products 
amplified with the 5' UTR primers at weeks 1-2 after 
inoculation of pCV-H77C were identical to that of pJ6CF 
indicating that the 2a genotype was still present and 
represented the majority species. These experiments 

35 confirmed that the inability of the intertypic la, 2a 
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cDNA clones to infect the chimpanzee was not the result 
of protective immune responses in the chimpanzee but 
represented deficiencies intrinsic to the chimeras. 

Discussion 

The published infectious cDNA clones of HCV 
represent the two most important subtypes of genotype 1 
(Hong et al., 1999; Kolykhalov et al., 1997; Yanagi et 
al.g 1997, 1998). However, 5 more major genotypes of 
HCV are recognized. In the above Examples, the 
infectivity of a cDNA clone of a second major HCV 
genotype was demonstrated. As in previous studies, the 
infectivity of RNA transcripts was demonstrated in vivo 
by intrahepatic transfection of a chimpanzee. This new 
infectious clone (pJ6CF) encodes the consensus 
polyprotein of HCV strain HC-J6 C H/ genotype 2a. Its 
encoded polyprotein differs from those of the infectious 
clones of genotypes la and lb by approximately 30% 
(Table 2) . Genotype 2 strains, in particular subtypes 
2a and 2b, have a worldwide distribution and important 
differences between genotypes 1 and 2 with respect to 
pathogenesis and treatment were indicated in previous 
25 studies. The availability of an infectious clone 
representing a second major genotype of HCV should 
permit new ways of studying the molecular biology and 
immunopathology of this important and genetically quite 
different human pathogen. 

The 5' and 3' UTRs of HCV are believed to be 
critical for viral replication, translation and viral 
packaging (Rice, 1996). The 5 f 203 terminal nucleotides 
and the 3* 101 terminal nucleotides of the published 
35 infectious clones of genotypes la and lb were identical. 
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However, the sequences of UTRs of the genotype 2a clone 

o 

differ from those of the genotype 1 clones. Overall, 
the 5' UTR of the genotype 2a clone has 17 nt 
differences and a single nucleotide deletion compared 
with the infectious clones of genotype la (Fig. 5A) . 
5 Five of these differences and the deletion are within 
the first 30 nucleotides, whereas the remainder are 
found within the predicted IRES structure. Differences 
also exist between the 3* UTR of the genotype 2a clone 
and the clones of genotype la (Fig. 5B) . The sequences 
of the variable region are very different. Recent study 
has shown this region is not critical for infectivity in 
vivo (Yanagi et al., 1999). Within the regions which 
are critical for infectivity in vivo (Yanagi et al., 

15 1999), the 132 nucleotide-long polypyrimidine tract of 
the genotype 2a clone has 3 unique A residues 
interspersed and the 3' terminal conserved region of 98 
nts has 4 nt differences within the 3' terminal stable 
stem-loop structure (Fig. 5B) (Kolykhalov et al., 19-96; 
Tanaka et al., 1996). Since the 2a clone was infectious 
these sequence differences are apparently real and are 
compatible with infectivity. Further studies are 
required to determine whether these represent critical 

25 genotype-specific sequences. 



30 



35 



SUBSTITUTE SHEET (RULE 26) 



WO 00/75338 



PCT/US00/15446 



- 49 - 



References 



1. Alter, M. J. (1997). Hepatology 26, 62S-65S. 

2. Blight, K. J. and Rice, C. M. (1997). J. Virol. 
71, 7345-7352. 

5 3. Brechot, C. (1997). Hepatology 25, 772-774. 

4. Bray, M. and Lai, C.-J. (1991). Construction of 
intertypic chimeric dengue viruses by substitution of 
structural protein genes. Proc. Natl. Acad. Sci. USA 
88, 10342-10346. 

IQ 5. Bukh, J., Apgar, C. L., Engle, R., Govindarajan, 
S., Hegerich, P. A. , Tellier, R. , Wong, D. C, Elkins, 
R. & Kew, M. C. (1998) . Experimental infection of 
chimpanzees with hepatitis C virus of genotype 5a: 
genetic analysis of the virus and generation of a 
standardized challenge pool. J. Infect. Dis. 178, 
1193-1197. 

^ 6. Bukh, J., Emerson, S. U. and Purcell, R. H. 

(1997). Genetic heterogeneity of hepatitis C virus and 
related viruses. In "Viral Hepatitis and Liver Disease, 
Proceedings of IX Triennial International Symposium on 
Viral Hepatitis and Liver Disease, Rome, Italy, 1996" 
(M. Rizzetto, R. H. Purcell, J. L. Gerin and G. Verme, 

20 Eds.), pp. 167-175. Edizioni Minerva Medica, Turin. 

7. Bukh, J., Miller, R. H. and Purcell, R. H. (1995). 
Genetic heterogeneity of hepatitis C virus: quasispecies 
and genotypes. Semin. Liver Dis. 15, 41-63. 

8. Bukh, J., Purcell, R. H. and Miller, R. H. (1993). 
At least 12 genotypes of hepatitis C virus predicted by 

25 sequence analysis of the putative El gene of isolates 
collected worldwide. Proc. Natl. Acad. Sci. USA 90, 
8234-8238. 

9. Choo, Q.-L., Richman, K. H., Han, J. H., Berger, 
K. , Lee, C, Dong, C, Gallegos, C, Coit, D., Medina- 
Selby, A., Barr, P. J., Weiner, A. J., Bradley, D. W., 

30 Kuo, G. and Houghton M. (1991). Genetic organization 
and diversity of the hepatitis C virus. Proc. Natl. 
Acad. Sci. USA 88, 2451-2455. 

10. Chambers T. J., Nestorowicz A., Mason P. W. and 
Rice C. M. (1-999) . Yellow Fever /Japanese Encephalitis 
Chimeric Viruses: Construction and Biological 

^ Properties. J. Virol. 73: 3095-3101. 



WO 00/75338 



PCT/US00/15446 



10 



15 



-SO- 
IL Dash, S., et al. (1997). Am. J. Pathol. 151, 363- 
373. 

12. Davis, G. L. , Esteban-Mur, R., Rustgi, V., Hoefs, 
J. .Gordon, S. C, Trepo, C, Shiffman, M. L., Zeuzem, 
S., Craxi, A., Ling, M.-H. and Albrecht, J., for the 
international hepatits interventional therapy group. 
(1998). Interferon alfa-2b alone or in combination with 
ribavirin for the treatment of relapse of chronic 
hepatitis C. N. Engl. J. Med. 339, 1493-1499. 

13. Fausto, N. (1997). Am. J. Pathol. 151, 361. 

14. Forns, X., Bukh, J., Purcell, R. H., Emerson, S. U. 
(1997). How Escherichia coli can bias the results of 
molecular cloning: preferential selection of defective 
genomes of hepatitis C virus during the cloning 
procedure. Proc. Natl. Acad. Sci. USA 94, 13909-13914. 

15. Forns, X. and Bukh, J. (1998). Methods for 
determining the hepatitis C virus genotype. Viral 
Hepatitis Reviews 4, 1-19. 

16. Fried, M. W. and Hoofnagle, J. H. (1995). Semin. 
Liver Dis. 15, 82-91. 

17. Frolov, I., McBride, M. S. and Rice, C. M. (1998). 
Cis-acting RNA elements required for replication of 
bovine viral diarrhea virus-hepatits C virus 5 1 
nontranslated region chimeras. RNA 4, 1418-1435. 

18. Hahm, B., Kim, Y. K. , Kim, J. H., Kim, T. Y. and 
Jang, S. K. (1998). Heterogeneous nuclear 
ribonucleoprotein L interacts with the 3' border of the 
internal ribosomal entry site of hepatits C virus. J. 
Virol. 72, 8782-8788. 

25 19- Hijikata, M. , Kato, N. , Ootsuyama, Y., Nakagawa, 
w M. , Ohkoshi, S. and Shimotohno, K. (1991). Hypervariable 
regions in the putative glycoprotein of hepatitis C 
virus. Biochem. Biophys. Res. Commun. 175, 220-228. 

20. Honda, M. , et al. (1996). RNA 2, 955-968. 

21. Hong, Z., Beaudet-Miller, M. , Lanford, R. E., 

30 Guerra, B., Wright-Minogue, J., Skelton, A., Baroudy, B. 
M. , Reyes, G. R. and Lau, J. Y. N. (1999). Generation 
of transmissible hepatitis C virions from a molecular 
clone in chimpanzees. Virology 256, 36-44. 

22. Hoofnagle, J. H. (1997). Hepatitis C: the clinical 
spectrum of disease. Hepatology 26, 15S-20S. 

35 



20 



WO 00/75338 



PCT/USOO/15446 



- 51 - 

o 

23. Houghton, M. (1996). Hepatitis C viruses. In 
"Fields Virology" (B. N. Fields, D. M. Knipe, P. M. 
Howley, et al., Eds.), Third ed. f pp. 1035-1058. 
Lippincott-Raven Publishers, Philadelphia. 

24. Khromykh, A. A. and Westaway, C. G. (1997). 
Subgenomic replicons of the flavivirus Kunjin: 

^ construction and applications. J. Virol. 71, 1497-1505. 

25. Ito, T. and Lai, M. M. C. (1997). J. Virol. 71, 
8698-8706. 

26. Khromykh, A. A., Varnavski, A. N. and Westaway, E. 
G. (1998) . Encapsidation of the flavivirus Kunjin 
replicon RNA by using a complementation system providing 

r Kunjin virus structural proteins in trans. J. Virol. 
72, 5967-5977. 

27. Kolykhalov, A. A., Feinstone, S. M. and Rice, C. M. 
(1996) . Identification of a highly conserved sequence 
element at the 3' terminus of hepatitis C virus genome 
RNA. j. Virol. 70, 3363-3371. 

15 

28. Kolykhalov, A. A., Agapov, E. V., Blight, K. J., 
Mihalik, K. , Feinstone, S. M. and Rice, C. M. (1997). 
Transmission of hepatitis C by intrahepatic inoculation 
with transcribed RNA. Science 277, 570-574. 

29. Lemon, S. M. and Honda, M. (1997) . Internal 
ribosome entry sites within the RNA genomes of hepatits 

20 C virus and other f laviviruses. Semin. Virol. 8, 
274-288. 

30. Lin, C, Lindenbach, B. D., Pragai, B. M., McCourt, 
D. W. and Rice, C. M. (1994). Processing in the 
hepatitis C virus E2-NS2 region: identification of p7 
and two distinct E2-specific products with different C 

25 termini. J. Virol. 68, 5063-5073. 

31. Lu, H.-H. and Wimmer, E. (1996). Poliovirus 
chimeras replicating under the translational control of 
genetic elements of hepatitis C virus reveal unusual 
properties of the internal ribosomal entry site of 
hepatitis C virus. Proc. Natl. Acad. Sci. USA 93, 

30 1412-1417. 

32. McHutchison, J. G., Gordon, S. C, Schiff, E. R., 
Shiftman, M. L., Lee, W. M. , Rustgi, V. K., Goodman, Z. 
D., Ling, M.-H., Cort, S. and Albrecht, J. K., for the 
hepatits interventional therapy group. (1998) . 
Interferon alfa-2b alone or in combination with 

35 ribavirin as initial treatment for chronic hepatitis C. 
N. Engl. J. Med. 339, 1485-1492. 



WO 00/75338 



PCT/US00/15446 



- 52 - 

o 

33. Mizushima, H., Hijikata r M., Asabe, S.-I., Hirota, 
M., Kimura, K. and Shimotohno, K. (1994), Two hepatitis 
C virus glycoprotein E2 products with different C 
termini. J. Virol. 68, 6215-6222, 

34. Nakao, H., Okamoto, H., Tokita, H., Inoue, T., 
Iizuka, H., Pozzato, G. and Mishiro, S. (1996). 

5 Full-length genomic sequence of a hepatitis C virus 
genotype 2c isolate (BEBE1) and the 2c-specific PCR 
primers. Arch. Virol. 141, 701-704. 

35. Nielsen, H., Engelbrecht, J., Brunak, S. and von 
Heijne, G. (1997). Identification of prokaryotic and 
eukaryotic signal peptides and prediction of their 

10 cleavage sites. Protein Eng. 10, 1-6. 

36. Novak, J. E. and Kirkegaard, K. (1994). Coupling 
between genome translation and replication in an RNA 
virus. Genes Dev. 8, 1726-1737. 

37. Nugent, C. I., Johnson, K. L., Sarnow, P. and 
Kirkegaard, K. (1999) . Functional coupling between 

15 replication and packaging of poliovirus replicon RNA. 
J. Virol. 73, 427-435. 

38. Okamoto, H., Kurai, K. , Okada, S. I., Yamamoto, K., 
Iizuka, H . , Tanaka, T., Fukuda, S., Tsuda, F. and 
Mishiro, S. (1992). Full-length sequence of hepatitis 
C virus genome having poor homology to reported 

20 isolates: comparative study of four distinct genotypes. 
Virology 188, 331-341. 

39. Okamoto, H., Okada, S., Sugiyama, Y., Kurai, K. , 
Iizuka, H., Machida, A., Miyakawa, Y. and 

40. Mayumi, M. (1991). Nucleotide sequence of the 
genomic RNA of hepatitis C virus isolated from a human 

25 carrier: comparison with reported isolates for conserved 
and divergent regions. J. Gen. Virol. 72, 2697-2704. 

41. Pletnev, A. G., Bray, M. , Huggins, J. and Lai, 
C.-J. (1992). Construction and characterization of 
chimeric tick-borne encephalitis/dengue type 4 viruses. 
Proc. Natl. Acad. Sci. USA 89, 10532-10536. 

30 42. Pletnev, A. G., Bray, M. and Lai, C.-J. (1993). 
Chimeric tick-borne encephalitis and dengue type 4 
viruses: Effects of mutations on neurovirulence in mice. 
J. Virol. 67, 4956-4963. 

43. Pletnev, A. G. and Men, R. (1998). Attenuation of 
the Langat tick-borne flavivirus by chimerization with 
35 mosquito-borne flavivirus dengue type 4. Proc. Natl. 
Acad. Sci. USA 95, 1746-1751. 



WO 00/75338 



PCT/US00/15446 



- 53 - 

o 

44. Reynolds, J, E., Kaminski, A., Kettinen, H. J., 
Grace, K., Clarke, B. E. , Carroll, A. R., Rowlands, D. 
J. and Jackson, R. J. (1995). Unique features of 
internal initiation of hepatitis C virus RNA 
translation. EMBO J. 14, 6010-6020. 

45. Rice, C. M. (1996) . Flaviviridae: The viruses and 
5 their replication, In "Fields Virology". (B. N. 

Fields, D. M. Knipe, P. M. Howley, et al., Eds.), Third 
ed., pp. 931-959. Lippincott-Raven Publishers, 
Philadelphia. 

4 6. Robertson, B., Myers, G., Howard, C, Brettin, T., 
Bukh, J., Gaschen, B., Gojobori, T., Maertens, G., 
10 Mizokami, M. , Nainan, 0., Netesov, S., Nishioka, K., 
Shin-i, T., Simmonds, P., Smith, D., Stuyver, L. and 
Weiner, A. (1998). Classification, nomenclature, and 
database development for hepatitis C virus (HCV) and 
related viruses: proposals for standardization. Arch, 
Virol. 143, 2493-2503. 

J5 47. Selby, M. J., Glazer, E., Masiarz, F. and Houghton, 
M. (1994) . Complex processing and protein: protein 
interactions in the E2:NS2 region of HCV. Virology 204, 
114-122. 

48. Simmonds, P., Holmes, E. C, Cha, T.-A., Chan, 
S.-W., McOmish, F. r Irvine, B. , Beall, E. , Yap, P. L. , 
Kolberg, J. and Urdea, M. S. (1993). Classification of 
hepatitis C virus into six major genotypes and a series 
of subtypes by phylogenetic analysis of the NS-5 region. 
J. Gen. Virol. 74, 2391-2399. 

49. Tanaka, T., Kato, N. , Cho, M.-J. and Shimotohno, K. 
(1995). A novel sequence found at the 3' terminus of 
hepatitis C virus genome. Biochem. Biophys. Res. 

25 . Commun. 215, 744-749. 

50. Tanaka, T., Kato, N. , Cho, M.-J., Sugiyama, K. and 
Shimotohno, K. (1996). Structure of the 3 f terminus of 
the hepatitis C virus genome. J. Virol. 70, 3307-3312. 



20 



30 



51. Tsuchihara, K. , et al. (1997) J. Virol. 71, 6720- 
6726. 

52. Tsukiyama-Kohara, K. , et al. (1992) J. Virol. 66, 
1476-1483. 



53. Vassilev, V. B., Collett, M. S. and Donis, R. O. 
(1997). Authentic and chimeric full-length genomic cDNA 
clones of bovine viral diarrhea virus that yield 
35 infectious transcripts. J. Virol. 71, 471-478. 



WO 00/75338 



PCT/US00/15446 



- 54 - 

o 

54. Weiner, A. J., Brauer, M. J., Rosenblatt, J., 
Richman, K. H., Tung, J., Crawford, K. , Bonino, F. , 
Saracco, G., Choo, Q.-L., Houghton, M. and Han, J. H. 
(1991) . Variable and hypervariable domains are found in 
the regions of HCV corresponding to the Flavivirus 
envelope and NS1 proteins and the Pestivirus envelope 

5 glycoproteins. Virology 180, 842-848. 

55. World Health Organization (1997). Hepatitis C. 
Weekly Epidemiol. Rec. 12, 65-72. 

56. Yamada, N., Tanihara, K. , Takada, A., Yorihuzi, T., 
Tsutsumi, M., Shimomura, H., Tsuji, T. and Date, T. 
(1996). Genetic organization and diversity of the 3' 

10 noncoding region of the hepatitis C virus genome. 
Virology 223, 255-261. 

57. Yanagi, M., Bukh, J., Emerson, S. U. and Purcell, 
R. H. (1996) . Contamination of commercially available 
fetal bovine sera vith bovine viral diarrhea virus 
genomes: implications for the study of hepatitis C virus 

15 in cell cultures. J. Infect. Dis. 174, 1324-1327. 

58. Yanagi, M. , Purcell, R. H., Emerson, S. U. and 
Bukh, J. (1997). Transcripts from a single full-length 
cDNA clone of hepatitis C virus are infectious when 
directly transfected into the liver of a chimpanzee.. 
Proc. Natl. Acad. Sci. USA 94, 8738-8743. 

20 59. Yanagi, M. , St. Claire, M. , Shapiro, M. , Emerson;, S. 
U.,* Purcell, R. H. and Bukh, J. (1998). Transcripts of 
a chimeric cDNA clone of hepatitis C virus genotype lb 
are infectious in vivo. Virology 244, 161-172. 

60. Yanagi, M., St. Claire, M. , Emerson, S. U., 
Purcell, R. H. and Bukh, J. (1999) . In vivo analysis of 

25 the 3' untranslated region of hepatitis C virus after in 
vitro mutagenesis of an infectious cDNA clone. Proc. 
Natl. Acad. Sci. USA 96, 2291-2295. 

61. Yoo, B. J., et al. (1995). J. Virol. 69, 32-38. 

62. Zhao, W. D., Wimmer, E. and Lahser, F. C. (1999). 
Poliovirus/hepatitis C virus (internal ribosomal entry 

30 site-core) chimeric viruses: improved growth properties 
through modification of a proteolytic cleavage site and 
requirement for core RNA sequences but not for core- 
related polypeptides. J. Virol. 13, 1546-1554. 



35 



WO 00/75338 



PCT/US00/15446 



- 55 - 

o 

WHAT IS CLAIMED IS: 

1. A purified and isolated nucleic acid 
molecule which encodes human hepatitis C virus of 

^ genotype 2a, said molecule capable of expressing said 
virus when transfected into cells. 

2. The nucleic acid molecule of claim 1, 
wherein said molecule encodes the amino acid sequence of 

10 SEQ ID NO: 2. 

3. The nucleic acid molecule of claim 2, 
wherein said molecule comprises the nucleic acid 
sequence of SEQ ID NO:l. 

15 

4. A DNA construct comprising a nucleic acid 
molecule according to claim 1. 

5. A DNA construct comprising a nucleic acid 
molecule according to claim 3. 

6. An RNA transcript of the DNA construct of 

claim 4. 

7. An RNA transcript of the DNA construct of 

claim 5. 

8- A cell transfected with the DNA construct 
of claim 4. 

30 9, A cell transfected with the DNA construct 

of claim 5. 
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claim 6. 
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A cell transfected with RNA transcript of 
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11. A cell transfected with RNA transcript of 

claim 7. 

12. A hepatitis C virus polypeptide produced 
by the cell of claims 8 or 9. 

5 

13. A hepatitis C virus polypeptide produced 
by the cell of claims 10 or 11. 

14. A hepatitis C virus produced by the cell 
10 of claims 8 or 9. 

15. A hepatitis C virus produced by the cell 
of claims 10 or 11. 

15 16. A hepatitis C virus whose genome 

comprises a nucleic acid molecule according to claim 1. 

17. A hepatitis C virus whose genome 
comprises a nucleic acid molecule according to claim 3. 



20 



25 



30 



18. A method for producing a hepatitis C 
virus comprising transfecting a host cell with the RNA 
transcript of claims 6 or 7. 

19. A polypeptide encoded by a nucleic acid 
sequence according to claim 1 . 

20. A polypeptide encoded by a nucleic acid 
sequence according to claim 3. 

21. The polypeptide of claim 19, wherein said 
polypeptide is selected from the group consisting of NS3 
protease, El protein, E2 protein or NS4 protein. 
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22. The polypeptide of claim 20, wherein said 
polypeptide is selected from the group consisting of NS3 
protease, El protein, E2 protein or NS4 protein. 

23. A method for assaying candidate antiviral 
agents for activity against HCV, comprising: 

a) exposing a cell containing the hepatitis 
C virus of claims 16 or 17 to the 
candidate antiviral agent; and 
10 b) measuring the presence or absence of 

hepatitis C virus replication in the cell 
of step (a) . 

24. The method of claim 23, wherein said 

15 replication in step (b) is measured by at least one of 

the following: negative strand RT-PCR, quantitative RT- 
PCR, Western blot, immunof luoresence, or infectivity in 
a susceptible animal. 

* u 25. A method for assaying candidate antiviral 

agents for activity against HCV, comprising: 

a) exposing an HCV protease encoded by a 
nucleic acid sequence according to claims 

25 1 or 3 or a fragment thereof to the 

candidate antiviral agent in the presence 
of a protease substrate; and 

b) measuring the protease activity of said 
protease. 
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26. The method of claim 25, wherein said HCV 
protease is selected from the group consisting of an NS3 
domain protease, an NS3-NS4A fusion polypeptide, or an 
NS2-NS3 protease. 
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27. An antiviral agent identified as having 
antiviral activity for HCV by the method of claim 23. 

28. An antiviral agent identified as having 
antiviral activity for HCV by the method of claim 25. 

5 

29. Antibody to the polypeptide of claim 19. 

30. Antibody to the polypeptide of claim 20. 

IQ 31. Antibody to the hepatitis C virus of 

claim 16. 

32. Antibody to the hepatitis C virus of 

claim 17. 

15 33. A method for determining the 

susceptibility of cells in vitro to support HCV 
infection, comprising the steps of: 

a) growing animal cells in vitro; 
20 b) transfecting into said cells the nucleic 

acid of claim 1; and 
c) determining if said cells show indicia of 
HCV replication. 

25 34. The method according to claim 33 , wherein 

said cells are human cells. 

35. A composition comprising a polypeptide of 
claim 19 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 

36. A composition comprising a polypeptide of 
claim 20 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 
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37. A composition comprising a nucleic acid 
molecule of claim 1 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 
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TAQCXDGAGIA GIGTIG33IC G0GAAAG30C TTOIGGEftCT GOCIGASMJS 300 

GIGCnQOGA GIO3000333 AGSICICGIA GAOX'IGCAC CAIGAQCAOG 350 

AATOCEAAAC CICAAAGAAA AAOCAAAOGr AACAOCAAOC GIOQ00CACA 400 

Q3AQGICAAG T 1 0JU£ G1G Q0G5ICAGAT lUl'lUJlUA GnTRCnGT 450 

TOQOQOXAG QQQOCmGA TlUJb ' lUlQC QOQOGACGAG GAAGACnOC 500 

GAGQ9SI09C AAOL'ICGAGS TAGAOGICAG UL'iMUUULA AGXA03I03 550 

G0GGGAG99C AGGAOCTQQ3 CTCAG00033 GIBGOCTIQ3 O0QCICEA1G 600 

GCAA3GAQQG TTQDQ33IQ3 GO30GAIGQC TGCIGTCIOC (XXSU33CHCT 650 

CG3CX7EAGCT G933GGCCAC AGAO000QQG QGTAUoiUUL' QCAATnQQS 700 

TAAGGICATC GKEADOCrm OGIQOQQCIT OQ00GAOCIC ATO333IACA 750 

TBOCQCTOGT CX3Q0Q00CXT CTIGGAG3QG CIG0CAQ3QC OCIQ30QCAT 800 

G3CX3KXX3QG TICIQGAAGA OQ3Q3IGAAC TATOCAACAG QGAAOCTTOC 850 

IGGTIGCICT T1C1CTATCT TOCTICIGX QCTOCICICT TGX'IGACIG 900 

'IULULUL' 1 'IL' AGOCraCCWV GTGOQCAATT CCICGGGQCT TIBGCA3GIC 950 

AGCAA3GATT GO0CTAACIC GAGIATIGIG T30GAG30GG Q0GA3QQCAT 1000 

CCIGC A CACT 00Q33SIGIG TOOCTTOOST T330GAGGGT AAO30CT0GA 1050 

UJ lbTl UJjr G30QGIGAOC QGGAOGSIQG QCAOCAG33A. OQ3CAAACIC 1100 

GQCACAAOQC AGCTICGAOG TCA3MOGAT CIQCTIGTOG Q3AG0Q0CAC 1150 

OJIUIUC'ICG QOOCTCIACG TO330GAOCT GTXG3GTCT GICITICnG 1200 

TIGGICAACT GTmOCTIC TCIO0CAQ0C GOCACIQGAC GACDCAAGAC 1250 

1QCAATIGIT CHA1UUA1UU OQ30CA2MA A0333ICA1C GCA1GQCA2G 1300 

GGA3AIGA3G CCGCIMX33Z AGCGTIGGIG GEAQCICAGC 1350 

TQCI003GAT CXXZ£AAGOC ATCA2QGACA 1GATOQCIGG TOCICACTG3 1400 

GGA GTOCIG3 O30QCAIAGC GTATl ' lL ' lUL ' A3G3IG333A ACIG333GAA. 1450 

OjlCC' IQ jm GH3 C I GCIGC TA UT1UULU 3 OSTOGACGOS GAAAOOCAQG 1500 

TCAOCOG333 AAA1G0033C OGCAOCAGQG I'lUmJLTlur 'lUJlL'lUL'iT 1550 

ACAGCAGXG CCAAGCAGAA. CATCCAACTO ATCAACACCA. AOG3CAGTIG 1600 

QCACATCAAT AGCAOOGOCT TGAATIGCAA. TGAAAGQCIT AACAOOGXT 1650 

QGTEAGCAQ3 GL ' lLTlL ' Eft T CAACACAAAT TCAACIUi'lU AGQCIGHXT 1700 

GAGAG3TIQ3 OCAGCIG00G AOQOCTOGC GAI'l'i'iUDOC AOQQCIG3QS 1750 

TQCTATCAGT TATOOCAAOG GAAGGGGGC' f OGAOGAAOX OOCEOQCT 1800 

GGCACiaOOC TCCAAGAOCT 'l GICGC A TIO '1UXCQCAAA. GAQQ3IGIQT 1850. 

QQQ00Q3EAT ATTGCTTCAC TO0CAG000C GTQGTGG7IG3 GAAOGAOCGA 1900 
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CAQ3IOQ33C GO30CIAOCT ACAGCIG333 TOCAAAIGAT A033ft3GICT 1350 

TOSTOCTEAA CAACAOCAGG OCAO0QCTO3 GCAATIQ3IT ( X GTI UI AQC 2000 

1QGAIGAACT CAACTG3A3T CAOCAAAGIG TOCEGAGOGC QJULT1U1U T 2050 

CA20QGAGG3 GIQ33CAACA ACAULT1ULT CIGCOOCACT GATIGCTT3C 2100 

GCAAACATOC G3AAGOCACA TftCICTOOST GOOQCIGCOG TOQ CTQ3K IT 2150 

ACADXM33T GCA3G3IGGA CraDDOSiaT AG3CTTTQ3C ACBMOCTIG 2200 

TACCA1CAAT T3002ATAT TCAAAGICAG GAIGIBCGIG GGAG335IU3 2250 

AGCACAGGCT G3AAG0QQ0C TOCAACIG3A. 033333333A. AOGCIGIGAT 2300 

CIGSAAGACA G3GACM33IC 03AGCICAGC OOSnGCIGC T3TOCACCAC 2350 

ACAGIG3CAG GlULTlUUbT Ul'lL'l'l'lCAC GAOXTOOCA UULT1U1ULA 2400 

GQGGOCICAT OCAOCTOCAC CAGAACATIG 1QGAOGIGCA. GEACTTOraC 2450 

033JTM33JT CAAGCATOQC GICCTG330C ATEAAGTGGG AGE^GTOST 2500 

TCIGCIGTIC CnCIGCTIG CAGA09330G OGTCIGCIOC ' lULTlUlUA 2550 

TCATOTEftCT CKDflODCAA GO3GAGQ0GG CITIGGAGAA, OCIOGIAAaA 2600 

CICAAJGCAG CATC0CTG3C CG3GACGCAC GGTCTIGTGT (JLTlUL ' lUbT 2650 

GITCnCIGC TTIGOGIGGT ATCIGAAGGG TRGGIGG GI G O0033M3033 2700 

TCTACQOOCT CTAGG3GATC TGOOdCICC IGCIGCIOCT (JL ' lUJLbTIG 2750 

OCTCAGOGGS CATA03CACT GGACA03GAG GIQQOQQOGT C GlGlG UU m 2800 

03TIGTICTT GID333TTAA TG3 0G CIG AC TCIG KO OC A TA3TACAAGC 2850 

GCTATAICAG CIG3IGCAIG TO5IQQCTIC AGTATTTICT GAGCAGAGIA 2900 

GAAGOQCAAC IQCADGflGIG G3TTO0000C CICAAOGfTOC G3GGGGGGQG 2950 

OGAIQOOoIC ATCTEACICA 7GXG1GIBGT ACAOGOGAGC CIGGIATTIG 3000 

ACATCAOCAA ACTACIOOTG G0CAICITO3 GAOOOCTTIG GATTCTICAA 3050 

GOCAGTTIQC TEAAAGTCOC CDOTCGIG OGOGTICA AG GULTIL ' IULG 3100 

GAICIGGQ03 CEAG3303GA AGA03GO33G AOGTCATTAC GIQCAAAIG3 3150 

OCAICAICAA GITW33GQ0G CITACK33CA GC1MG1GIA TAAOCATCIC 3200 

AOOOCTCTIC GAGACIGGGC GCACAACG3C CIGGGAGATC 1GG033IG3C 3250 

T3IQGAAOCA GRJb'lLTlLT O00GAATO3A GAOCAAGCIC ATCAQ3I 03 3 3300 

GGGCAGAOaC OQ0030GIGC GGIGACATCA TCAACGGCTT UU G CG1 C 1C T 3350 

Goaasnms qqcaggagat acigctigqg cjcagoogacg gaatggicic 3400 

CAAGQGGfIGG AGGTIGCIG3 O300CATCAC G3O3TACQ0C CAGCAGAOGA 3450 

GAGGQCTOCT AG33IGEATA AIGAGCAGGC TGACIGX03 GGACAAAAAC 3500 

CAAGIGGAGS GIGAGGIGCA. GATO3IGICA ACTGCEAGOC AAA OCTTOCT 3550 

G3CAAOGIGC ATCAATO3QG TATOCT33AC TGICMCAC GGGGOGOGAA 3600 

03W33AOCAT OGCATCAOOC AAGGGIOCIG TCATOCAGAT GTATAOCAAT 3650 

GIGGACCAAG AOCTIGIGQG a uaJ U OJL T OCTCAAGGTT O0GGCICA2T 3700 

GACAOOCIGT ACC1Q093CT G3CPG0GADCT TIBGCTG3IC AC3GAG3GAOG 3750 

O0GA3GTCAT TCOOGIQOQC O330GAGGIG MPGCP0333 TAGOCIGCTT 3800 
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TOGOCOCGGC OCATriUL'lA CTIGAAAGGC TO CIOQQ9Q3 GIQ OGL ' IUIT 3850 

GTQQ000033 QSOGGOOG TO33XTATT CAQQQOCQC G GIGIGCAOCC 3900 

GTGGAGIG3C TAAAGCQ3IG GALTl'lMUL' CIGIGGAGAA. OCIBG33ACA 3950 

AGCA3GAGAT 00CO33IGIT CAQ33ACAAC TOCIUIOCAC CAGCAGIQX 4000 

CCAGAGCTIC CAGGIQGOCC AOJIGCATOC TGCCAD033C AGCG3IAAGA. 4050 

GCAOCAAGGT OC309X1T3CB TAOQCAGOCC AQ33CDOA G3IGTIQ3IG 4100 

cicaaococt erorrociGC aa uuc ' iggx ' TnGsrocrr Ac aiuma A. 4150 

G3CXDCATO33 GTIGAIQCm AIA1CAQGAC OQGGGIGAGA. ACAAIBOlA 4200 

CIGGCAG00C CATCA03EAC TOCAGCIM3 GCAAGITQCT TOOOGAOGGC 4250 

GQGIQCICAG GAGSIGdTA TGACAOaAIA ATTIGIGAG3 AGIG0CAC1C 4300 

CAOQGA3GQC ACATOGATCT TO33CATO33 CACiUlUJIT GAGCAAGCAG 4350 

AGACIG0Q33 G303AGACIG GTIGIQCTOG OCAL'IUL'UAC QCTXPUUL' 4400 

TCCGICACIG TGTOOCAIOC TAACATOGAG GAUL.TIUL'IC 1GTOCAOCAC 4450 

OQGAGAGAIC OCLTHTAOG GGAAGGCD^T CCOCC'ICGAG GTGAICAAG3 4500 

QGGGAAGACA TCTCAICl'lC T3XACICAA AGAAGAAGIG OGA03AGCIC 4550 

GOCX303AAGC TOGTO3CATT G3QCAICAAT G00GIGG0CT ACIAG33333 4600 

TCITGAOGIG TCIGICATCC CGAGCAG0Q3 OGAIGTIGIC GTOGIGTOG A. 4650 

GOGAIQGICT CATCACIGQC TmOCGQCG ACTIOGACIC 1UTGAIAGAC 4700 

TGCAACACGT GIGICACTCA GACAGTCGAT TICAGOCTIG AXCTAOJIT 4750 

TADCATIGAG ACAAOCAOGC TO30DCAQGA. TXTGflCIOC AG3ACIGAAC 4800 

G00Q93GCAG GACIGGCAGG G3GAAG0CAG GCAIUEA3^G ATTIGIG3CA 4850 

OOQQQQGAQC QJCCCTCOGG CA TOITOGA C ' 1 UG 1 U0S1OC TCIGTGAGIG 4900 

CEAIGAOQOG QQCIGIGCIT GGHMGAGCT CAO3X0GCC GAGACDOG 4950 

TEAGGCEAOG AQCGIACA1G AACAO00CGG UJLTlUUUbT GIGCCAGGAC 5000 

CAICTIGAAT 1TIQ3GAGQG (JLJ1LTJ.TACG GOOCICACIC A37fflAGAIGC 5050 

GCACnTTEA TOGCAGACAA AQCAGAGIGG GSAGAACTIT 0CTEA0CIG3 5100 

TAGC&AGCA AGGCACQGrIG T3COCTPG33 CIGAAG0OCC TOQ00CA3OG 5150 

TOGGAGCAGA TGIGGAAGIG TTIGAIDQGC CITAAAOOCA GOCTOCAT33 5200 

GOCAACAOOC CIGCEATACA. GACIQGQCQC TGTK7GAAT GAAGICAOQC 5250 

TGAOGCAGCC AAICAGCAAA TACA2CA1GA. CAIGGAIGIC GG003ACCIG 5300 

GAGGIOG1CA OGAGCADCIG omUL ' lLUlT UJUm^ l CC TOGC 'I O l' l CT 5350 

QGOOOOGIAT TQCCIGICAA CAGGCIGOGT Q3ICATAGIG GGCAGGATOG 5400 

TCTIGIOCGS GAAGGQG3CA ATIAIAGCIG ACAGGGAGGT TCICEAGCAG 5450 

GAGTTOGMG AGATOGAAGA. GTGCTCICAG CACTIAO0GT ACATOGAGCA 5500 

AGGGATCA1G CICGCIGAGC AGTICAAGCA GAAGGGGCIC GGOCTCCIGC 5550 

AGAOQ30SIC 0CO0CATOCA GAUUI'IAILA OOQCIGCIGT QCAGAOCAAC 5600 

TQQCAGAAAC TCGA G5ICIT TIQGGOGAAG CACAIGIGGA ACTTCATCAG 5650 

TGOGA33CAA TACITOGDGG GOCTGTCAAC UL ' l ' UX ' lUar AACOOOQOCA 5700 
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TIGCTICKET Gft TOGCTnT ACRQCiOOCG TCAOCAG00C tCBWXZCT 5750 

G30CAAAOOC TOCICITCAA CMMT3333 GG3TG33IGS CIGOOCRGCr 5800 

uxnxecoc QGiQooGcm craocnrar g33iqciq3c cebgc'iculu 5850 

00GCCATO33 CftGOGTIGGA OG39GMGS TOCTCGTQGA CATICITOCA 5900 

GGSBOGG0S 033333TG3C 033AGCICIT CTBQCATICA. AGATCAIG&G 5950 

Q33IGAG3IC QCCTX3C A Q33 M3 3 ECK33T CAATCIGCIG O0QQ0CKDX 6000 

' I L' I UUUU I UG AXDCTIGm GTO93IGIQ3 TCIGOGCftGC AKERCIGOQC . 6050 

G33ZAOGTIG G300Q330GA G3333CAGIG CAA1G3A3GA AOCDQCEAKT 6100 

AGQCIT0QOC TOOOQQGGGA ACCAIGITIC OCXEAOXAC TADGTIOCD33 6150 

AGAGCGftlQC AGO0GOOQ9C GICACIGGCA TPCTCM3CPG CCTCACIUDV 6200 

AQOCBGCTOC TCAG9QGACT GCATCAGT33 MAAOCTCGS PGTG&E&C 6250 

TOCKIGCia: GSnCCTOX TAA333&CAT CT333ACTG3 AEKD30GM33 6300 

T3CIGAOOGA CITD^AGACC TOXTCAAftG C3CAAOCICAT G3CACAACIG 6350 

OCIGC33ATIC CLTI ' IUIUIL ' CK3QCAG33Z Q33E&IAG33 QQSICIGGOS 6400 

AGGAGA03X MTATOCACA CICQCT30CA ClG'lUoAQCT G&GA1CACIG 6900 

GACATCICAA AAA0QQGAO3 ATCAGGATO3 TOQSTOCIAG GftOCTOCMS 6950 

AACATGTQGA GTO33ADGTT CQ0GATI7AC GOCTACRGCA OQQQG0GCIG 6550 

TAcrooccrr qcigogqqga aceatamht ooaocnGios aqqstoicig 6600 

CAGAQGAKEA CX3IQ3AGA3A AGQ0QQ3IG3 G33ftCTiaCA CEACUJLMUo 6650 

GGIR3GACIA CTGACAATCT TAAATO00CG TOOCfiGATOC OflCQOOOGA 6700 

AIOTTCACA GAATIGGftOS OGO'IGCGOCT PCPCM3JTTI GQQ0Q00CIT 6750 

QCAAQO0CIT QCIQ0G03AG GWJLs'lMLAT ACTCCACGfiG 6800 

TOO00GSIOG GGTCGCAATT ADCTIQ0GW3 O00GAAOO33 PGJU&XXJT 6850 

GTT3AD3ICC ATOCIC&CIG ATOQCICOCA TPTMCPGCK GW3309300G 6900 

GSAGAAOSTT OQOafiGftQQS TCAOOOOCIT CBOG3XH3 CIGCKX3GCT 6950 

MCCftGCIGT CCX32TOCAIC TCTCAAG3CA JOT9CMOG CCAftOCMGA 7000 

CT OOOCI Gft C GQCXSfiQCICA TSGAG3CTAA CCTOCIGIG3 A33CZW3GftGA 7050 

TO030Q3CAA CATCftCCRGG GITGBGTCPG AGAftCAAfiGT Q3IGRTICIG 7100 

Q O OC n OS AiOJULTlUr QQCftGftGGfiG GftlGftQ3333 AQSICTOOCT 7150 

AOCIQCM3AA ATICIGOOGA AGIUICG3AG AT1UJUOCI33 G00CTO000S 7200 

TCIQ33QQQG OXOGftCCRC AAO00G00QC TPGTPGPGbC GI GGAAAA ftG 7250 

OCIGftCTftOS AftraOCIGT (jb' l UC m 03 C TGOOCX3CERC CACCPXAGG 7300 

GTOOOCIOCr GIGJCroOGC CTOQGAAAAA GOGTftOGSTG m ulLftCOS 7350 

AATCAACOCT ATCEftCIGCC TT3QOCGM3C TIOCCACCAA. M urmw. ' 7400 

AGCTOCICAA CTTOOOXAT TfiOSQQOGftC AMBGGPCAA Oau.ii.ivjH 7450 

occoaxarr tctogctgo c qqooog&cic cgpogtigpg toim tcit 7500 

OCATO0000C OCIQ3AGQ33 GAG0CIG333 ATC03GKTCT CBOQGftCQQG 7550 

TCATO3T0GA C33TCPGTPG TGGGGOOGftC A033AAGATG T0GTGTGCIG 7600 
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CTCAftTOICr TATIOCTGGA CAGGOGGO* C GI C AOJ OJ U TO QG CIOJ UU 7650 

AAGAACAAAA ACIGOOCATC AAOQCACIGA. QGAAUiUbTi' GCTAQGQCAT 7700 

CACAATCIGG TCTATTOCAC OOTCAOX AGTOCnGCE AAAGGCAGAA 7750 

GAAAGICACA TTIGACAGAC TQCAWJl'lLT GGACAGOCAT TACCAGGAaG 7800 

TGCICAAQGA GGTCAAAQCA OJGQCU1CAA AAGTGAAGGC TAA LTlUL ' iA 7850 

T00SIM3AG3 AAGCTIGCAG OL'IGAOGOOC CCACATICAG OCAAATCCAA s 7900 

UiTlUJL ' llA T GGGGCAAAAG AOGflXXETIG OCATOCCAGA AAGQOOGraG 7950 

OOCACATCAA CTJ03IGIG3 AAAGAOTTTC T33AAGACAG T3IAACAOCA 8000 

AIAGACACIA OC A 1CA3G3C CAAGAACGAG GTTnCTGOG TTCAGOCIGA 8050 

GAAGGGGGGT OGTAAGOCAG CIQ3ICICAT CUlUi'lUULU GAOCIQQGOG 8100 

UG OGOGI U IG OGAGAAGATO GOGC I GlflQG AUb'lUolTAG CAAG C ' IOJUL: 8150 

CTG QO QGTGA , TGGGAAGCIC CEACGGATIC CAATACICAC CAGGACAGGG 8200 

GGTIGAATIC CTOGIGCAAG OGIGGAAGIC CAAGAAGACC G0GA1GGGGT 8250 

IL'iUb'lMGA. TAO00GCIGT TTIGACTCCA CAGTCACTGA GAGOGACATC 8300 

QGTAGQGAQG AQ3CAATTEA OCAATCTDGT GACCIGGAOC GOCAAGOOOG 8350 

CGTG30CATC AAGICOCICA CIGAGAGGCT TMGTIGGG GGOX'lL'i'lA 8400 

GCAATICAAG GGGGGAAAAC TOOGGCIBGC GCAGGTGOCG GGOGAG0QGC 8450 

GIACIGACAA CEAGCIGIGG TAACACQCIC ACTIGCTACA. TCAAGG0G0G 8500 

GQCAGOCIGT OGAGQOQCAG GGCTQCAGGA CIGCACCATC CIOGIGIGIG 8550 

GGoAOGACIT AGTOGITATC TGTGAAAGTG 0GGGGGT0CA GGAGGAQGGG 8600 

GOGAGOCIGA GAGOCTICAC GGAGGCIMG ACXZAGGTACT COGCHXGX 8650 

OGGGGACOOC GCACAACGAG AA3EGGACTT GGAULTIMA ACATCAIQCT 8700 

OCT0CAAOGT GICAGTOGOC CAOGAO330G CIGGAAAGAG GGICTACIAC 8750 

CmOOCXSIG ACOCraCAAC OOOCCTCGOG AGAGQOGOGT G9GAGACAGC 8800 

AAGACACACT OCAGICAATT OCIOGCIMG CAACATAA3C A3GTTIGO0C 8850 

GOOCIGTG GGOGAQGA1G ATACIGATCA OOCATTICIT TMGGTDCIC 8900 

A2AG0CAGGG A3CAGCTIGA. ACAGGCICIT AACIGTGAGA. TdaCGGAGC 8950 

C'lUL'UACIOC A2BGAAOCAC TGGATCIAOC TOCAAICATT CAAAGACICC 9000 

A1UGGCICAG GGCATinCA CTOCAGAGIT AC1CIOCAGG TGAAAICAAT 9050 

AGGGIGGOOG CKPGOCTCAG AAAACTIQGG GTOOOGOOCT TGOGAGCITG 9100 

GAGACACOGG GO00GGAG0G TQGGOQCEAG GCTTCIGIOC AGAGGAGGCA 9150 

QQGCIGOCAT ATGIGGCAAG TAOCICITCA ACTGGGCAGT AAGAACAAAG 9200 

CTCAAACICA CTOCAATAGC OGOOXTGX OQGCIGGACT T3T003GTIG 9250 

GnCAOGGCT GGCBOGOG GGGGAGACAT TIKECACAGC GTCICICA3G 9300 

C CO3G0000 G CIOGTICIGG TTTIGOCTAC ' lOC ' lUL ' llUC TGCAGGGGTA. 9350 

G3CATCIWX TOCIOOOCAA GOGAIGAAGG TTGGQGTA AA. CACT00Q30C 9400 

TCmAGOCA TTTOCIGTIT Tl ' lTmTlT TrnnTTIT TnTTCTnT 9450 

Tnnnurr Tocrnocrr cnnTrroc Tncrrrnc ccncnrAA 9500 
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T3GIGQCICC ATCTEftGQQC TMJVCNJ3CC TAQTK3TCAA AGGIO03IG& 9550 
GQQGCKIGAC T3ZAGAGAGT GCIGKBOG GCCICICIGC A3ATCA3GT 9599 

FIG. 6F 



SUBSTITUTE SHEET (RULE 26) 



WO 00/75338 



13/22 



PCT/US00/15446 



H77C 



10 20 30 40 50 

1234567890 12 34567890 1234567890 1234567890 1234567890 

MSINPKFQRK TKRNINRRPQ DWFK3GQQI V33VYHiERR GPRLGVRA1R 50 

KTSER9QERG KRQPIFKARR PEEKIWAQPG YBrMJtWES OGWAQWLLSP 100 

PGSRPSWGPT DPRRRSRNLG KVHHLTO3F AEKM3YIPLV GAPD3GAAFA 150 

LAB3VKVLED GVNYAIGCiP GCSFSIFLLA LLSCUIVPAS ATORNSSGL 200 

"YHV3MX3NS SIVYEAADAI IHTPQCVPCV REENASRCW7 AVIFIVKTRD 250 

GKLPTIQLRR HU2IM3SAT LCSALYV3Xi GGSVFLW3QL FIFSFRKHVT 300 

TQtXNCSIYP GHTIGHFM&W EMMgWSPm AUWAQIiRI FQAMMAG 350 

AHW3VIAGTA YFSMVQtt&K VUNLLLF2G VDAE3H7IG3 NAGKTIAGLV 400 

GLUTGftKQN IQLJNINGSW KENSEALNCN ESLNIQWLAG LFYQHKFNSS 450 

GCPEELA9CR FL1EFAQGWG PISXANGSGL EERPXCWttP EREOSEVPAK 500 

SVCGEVYCFT PSPWW3TID RSGAPIYSWG ANDID/FVUN NIBPPLGtWF 550 

GCIVMNSIGF 1KMDGAPFCV IGG^NHIj CPIECFBKHP EATYSROG9G 600 

PWTTERCMVD YPYRLWHYPC T3JWTIFKWR MYVG3VEHRL EAAOWIRGE 650 

RCOLEZFORS ELSPLHSIT QW2WLFCSFT TLBP&5K3SL HLH^UMDJQ 700 

YLYGU3SSIA SWADXWEWV LLFLLLADAR \A29CL&M1LL ISQftEAALEN 750 

LVUNAASLA GIHGLVSFLV FFCFMfLXG FWVP3AVXAL Y3MPLLLLL, 800 

IALPQRAYAL DIB/AA90QG WLVGL*ftLT LSPYYKKYIS VOl^ILQYFL 850 

TRVEAQLHVW VPPLNVRGGR DAVHUCCW HPTLVFDITK LLLAIBGELW 900 

3XQASLLKVP YFVKVQGLLR ICALARKIAG GHYVQMAITJK L3ALTCTYVY 950 

NHLTELRDWA HN3LKDLAVA VEPWFSRME TKLTIWGADT AAOGDIINGL 1000 

PVSARRGQEE LLGPADGMVS KQWRLLAPIT AXAQQIEGLL QCIITSLTGR 1050 

EKNQVB3EVQ IVSTKIQIFL ATCINGVCWT VYHGAGIKTT ASPWGPVIQM 1100 

YlfrMDQELVG WPAPQGSRSL TPCTOGSSCL YLVTFHMWI PVRRRGDSRG 1150 

SLLSPRPISY LKGS9QGPUj CPAGHAU3LF RAAVCTRGVA KAVEFIPVEN 1200 

LGTIMRSPVF TENSSPPAVP QSFOTHLHA FIG93KSIKV PAAYAAQ3YK. 1250 

VLVLNPSVAA. 1LGFGRXMSK AH3VDENTKT GVKI'ITIGSP nYSITCKFL 1300 

ADQQC9G3AY DIIICEECHS TTftTSH-GIG TVLDQftEIRG ARLAWLATKT 1350 

PPGSVIVSHP NIEEVALSTT GEXPFYGKAE FLEVIKQGRH KEPCHSKKKC 1400 

DELAAKLVAL GINAVAYYFG UWSVIPTSG EWWSEDftL MIGFIGCFDS 1450 

VUXNICVIQ TVEFSLDPIF TEEITILPCP AVSKTCPPSl 1GR2CPGIYR 1500 

FVAPGERP9G MFDSSVLCEE YEftGCAWiTEL TEAETIVRLR AKMNIPGLPV 1550 

OQEHLEFWEG VFTGLTHIEA HFLSQIKQ93 ENFPYLVAi!Q A1VCARAQAP 1600 

PP9WDQC4rfKC LIRLKFILH3 PTPLLYRLGA VOSJEVTLTHP HKY3MICMS 1650 

ADLEWTSTW VLV3GVIAAL AAHXSIGCV VIVGFIVLSG KPAHPEREV 1700 

LYQEHEMEE CSQHLPYIEQ GMMLAEQFKQ KALGLLQIAS FHAEVTTPAV 1750 

QINWQKLEVF WftKHMrNFES GIQYLAGLSr LPGNPAIASL MAFEAAVTSP 1800 

LTIQQIUfN HJQGWWAAQL AAFGAAIAFV GAGLAGAAIG S\K3JGKVLVD 1850 

ILAGYGAGVA GALWKEMS GEVPS'lfcliLV NLLPAILSPG ALW^VCAA 1900 

FIG. 6G 
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HHRHVGPCE GAVgwyNRLI AEASRGNHVS PTHWPESCA AARVEAILSS 1950 

LTVIQLLPRL H3WT.SSEETT PCS3SWLRDI WDWICEVLSD FK3WLKAK1M 2000 

PQLPGIPFVS CQRGYPGVWR GDGIMHIRCH OGAETIGHVK NCJIMRIV3FR 2050 

TCRNWSGIF PINAYTIGFC TPLPAPSKKF ALWRVSAEEY VEXRRWGEEH 2100 

YVSGMTILNL KCPCQIPSPE FFTELDGVRL HREAPPCKPL LREEVSFRM3 2150 

LHEYFvGSJL FCEPEPEHAV LT94LTDPSH rEAEAAGRRL ARGSFPSMAS 2200 

SSA92LSAPS LKAICIRNHD SPDAELTEHN UWRQEM3GN ITKVESENKV 2250 

VHDSFDPLV AEECEKEVSV PAEELPKSRR EARALPVWAR PDXNPPIWET 2300 

WKKPDYEPPV VB3CPLPPPR SPPVPPERKK RIWLTESTIi SBttAELAlK 2350 

SPGSSST9KE TGENITISSE PAPSQCPPDS EWESXSSMPP LB3EPGDTOL 2400 

SDGSWSIV5S GADEEEWVOC SMSYSWIGAL VIFCAAEEQK LPINALSNSL 2450 

LRHHNLVYST TSRSAOQRQK KWTFDBLQJL DSHYtDTLKE VKAAASKVKA 2500 

NLLSVEEACS LTPPHSAKSK FGYGAKEWRC HARKAVAHIN SVWKTXIEDS 2550 

Vl'PUUl'l'JLMA KNEVPCM2PE KQGRKPARLI VFPELGVRVC EKMALYEWS 2600 

KLPLAVM3SS Y3FQYSFQQR VEFLVQAWKS KKTRCFSYD TRCFESTVIE 2650 

SDU&EEAIY QCOXDPQAR VAIKSLTERL YVGGPLTNSR GENCGYFPCR 2700 

ASGVLTT9CG NILTCYIKAR AACRAAGLQD CTMLVGGCCL WICESAGVQ 2750 

EDAASLRAFT EAMTRYSAPP GDPPQPEYEL ELTT9CSSNV SVAHDGAGKR 2800 

VYYLTRDPTT PLARAAWEIA RHTPVNSWLG NIIMFAPIUW AFMELMIHFF 2850 

SVLIARDQLE QALNCEIYGA CYSIEPLDLP PIIQRIH3LS AFSLHSYSFG 2900 

EZNKVAACLR KLGVPPLRAW RHRARSVRAR LLSRGGRAAI OGKXLFNWAV 2950 

KIKLKLTPIA AAGRLDL9GW FEAGYS3C33I YHSVSHARPR WFWFCLLIIA 3000 

AGU3IYLLEN R 3011 

FIG. 6H 
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QQCAQQ000C TCATO3333C GACACICCAC OflGAATCAC TO00CT3 TOA , 50 

GGAACTACIG '1LT1CADQCA GAAAGOGICT A30CMQ30C3 TIAGIATCAG 100 

TOTO SrocaG (XIOCAQGAC OOCOOC'ICOC QQGAGAQ0CA. T&jIGSICIG 150 

OQGAAOCEGT GAGTACAQaS GAATIQOCAG GACGAG0333 TOCTTIdTG 200 

GAICAADQOG CICAA3G0CT GGAGATITGG GOGIQOQOOC GOGAGACIGC 250 

TAGOOGAGTA GIUTTO33TC G0GAAAG30C TIGTGGEACT GGCTGATA03 300 

GT3CTTG03A GIGOOOOGGG AQGICTOSm GACOGIGCAC CAIGAQCACE 350 

AATCCTAAAC CTCAAAGAAA AAOCAAAOGT AACACCAACC GCQGOCCACA 400 

GGAOSICAAG TIO0033335 GTO3ICAGAT OGTIGC3IQ3A . GTTEACCIGT 450 

TOQQQQQCAG OGGOOOCAQG T10GG1GTGC GCX30GACTOG GAAQGC TI UL' 500 

GAGOGGTOGC AAOCTQGIQG AAGQOGACAA CCEATOQCAA AQX'llUUUG 550 

AO00GAGGGC AGQXC1GGG ClCAQCJOaGG GmjULT l UL? OCQC' I LT A TC 600 

QCAATGAGQG CCTGGQGIGG GGAGGATO3C TOCIGTCACC OOGOGGC1CJC 650 

OGiGOCiaGrIT GGGG000CAC GGAOG00G93 03TA93TO0C GTEAACTIQ93 700 

TAAGGTCATC GATAOQCITA CAT3093CIT CGCQGATCIC ATOQ93TACA 750 

T TOOGCICCT OQGCGQQQQC CEAGGQQGOG CIGCCAGGQC CTIQQCACAC 800 

GSIGTOOGGG TICIQ3AGGA O330GTGAAC TAIQCAACAG QGAA CTIQQC 850 

OGGTIGCICr TICIUI A TCT 1CCICTIQ3C 'IL ' HJL ' IUUJL ' T3ITIGA OCA 900 

lOOCAQCnC (JULT1MGAA GTOOQCAAOG T3TCOQ9GAT ATAOCATCK: 950 

AOGAACGACT GCTCCAACIC AAGCATIGIG TATCAG3CAG OQGAOGIGAT 1000 

CAIGCATACT OCX33QGTQQG TGOOCIGTGT TCAGGAQQGT AACAGCIDCC 1050 

GTIUL'IUGG T AGQ3CICACT 00CAO3CI0G GGGCCAGGAA TOOCAGOGIC 1100 

CGCACEAOGA CAATA03A03 CCAOGIOGAC TIGCTCGTIG GGA03QCIGC 1150 

' I ' l ' lUlUL ' lUC ' GCTATGTAOG TQGQOSATCT CIGCX3GATCT Al'l ' l'lUL ' lLXj 1200 

TCICOCAGCT GTTCAOCTIC TOGOCTOGOC QGCA2GAGAC AGIGCAGGAC 1250 

T3CAACIGCT CAATCEATOC OQGOCATCEA TCAQGICACC GCATO3CTIG 1300 

02ATAIGAIG ATGAACTOGT CAOCIACAAC AGOCCTAGIG GTGIOQCAGT 1350 

TQCTOQQGAT CXX2CAAGCT GTOGIGGACA 1GGTGGOQQG QQQOC A CTQG 1400 

GGAGTCCIGG QQQQOCTIGC CTACIATIOC AIO3TA0G3A ACIG03CTAA 1450 

GSTICIGATT GTQQOQCEAC TCTnOOOQG CX3ITGAOQQG GAGAGOCACA 1500 

OGADQQQGAG GG1GUUUUGC CACAOCAGCT G03GGTTCAC U1UULT1T1L ' 1550 

TCATCIGGGG OGICICAGAA. AATOCAQCTr GTGAATAOCA AGQQCAGCIG 1600 

GCACA3CAAC AGGACTQOOC TAAATIGCAA TGACIOOCIC CAAACIGGGT 1650 

TCmGOOGC GCIGITITAC GCACACAAGT TCAACTOGTC OQ33IO000G 1700 

GAGOGCA1GG GCAGCIGOOG OQOCATIGAC l U Ljl ' lO jU U L' AGGGGIGG33 1750 

OQQCATCAOC TATACIBAQC CIAACAGCIC GGATCAGAGG CUITATIUCT 1800 

G3CKTTKX3Z GOCTOGAOOG TGI GG TOI O G TAOO0G0GTC QCAGGTGIGT 1850 

GGTOCAGTGT ATIGmCAC O0CAAG00CT GTIGI G SIGG GGAGCADOGA 1900 

FIG. 7A 

SUBSTITUTE SHEET (RULE 26) 



WO 00/75338 



16/22 



PCT/US00/15446 



HC-J4 



10 20 30 40 50 

1234567890 123 4567890 12345 67890 1234567890 1234567890 

T Q SnOO Q ST GIDGCEAOGT ATRQCIQ9GG GGAGAAIIGAG ACAGA03IGA 1950 

TOCTOCTCAA. CAACAOGaGT OOGOCACAAG QCAftCIQSIT 03GCIGEACA 2000 

TOGATCAATA GEACTO3SIT CACTAAGACG TQCJ9GAG3IC OOOOGTOIft A 2050 

CATO3GGGGG GTCQ3EAA0C QCALLTJ.GAT CTQOOOCAGG GACTQCTIOC 2100 

Q3AAGCAO0C OGAQ3Ciacr TOCACAAAAT GTO3CTOGQG QOOCTOSITB 2150 

ACAOCTAGGT OXTAGEAGA CEAOTATAC A UJL ' ITIUUU ACT A OGO l'lli 2200 

CACICICAAT TnTOCATCr TIftAGGTrAG G KlUlftlG ' lG GQGGamUi 2250 

AGCACAG3CT CAA2QGCGCA. T3GAATIGGA. CTOGAGGAGA. G3QCIUIAAC 2300 

TK33AQGACA G93A3K33IC AGAACTCAGC OUUL ' IUL ' IGC TCICDOAC 2350 

AGAGT3XAG ATACIQOOCT GIQCTTICAC CAC0CIAC08 QCTITATCCA. 2400 

CIG3ITIGAT OCATCIOZAT CAGAACATCG T33AOSIQCA ATAOCIGEAC 2450 

GGIGTAGGGT CAQCXHTIGT CrOdTIGCA ATCAAAT3GG AGIACATCCT 2500 

GnGcrmc cticicciqg cagaoqogqg osrareraoc toctigigga. 2550 

TCATOCTOCT GA3AQ00CAG QCIGAQGOCG GCTTftGAGAA CnOQIGSIC 2600 

CIGAAIGGGG 03IOCX3IQQC QQGAGOGCAT GSBflTCTCT (JLTl'lLTlUr 2650 

Gncncroc gocqociost acateaagqg caqx'igxt oaooggoas 2700 

OSTATOCnT TEAIQGOGTA 7GQQ0QCIQC 'lOC ' ICC ' I C CT ACTGQ0STIA 2750 

CCADCAOGAG CTEAQQOCTr QGA003GGAG ATQGCTGCAT OSIGOQGGQS 2800 

TQ QQ GTICIT GTAGSICIQG TATIUTIGAC CTIGTCAOCA TPCDOAAG 2850 

TGfrnCICAC TAGGCICA3A TQGIGSITftC AATACTTTAT CAOCAGAGOC 2900 

GAGQQGCACA TOCAAGIUIG QGIC0000QC CICAACGTIC QQGGAGGOOG 2950 

CGATOOCATC ATCCTOCICA OSIGTOOGGT TCA30CAGAG TEAATITTIG 3000 

ACATCACCAA ACT0CIQCIC QQCATACTOG GQC09CICAT GGTGCTOCAG 3050 

GCIGGCA32A CGAGAGTGOC GDOTOGTC OQOGCICAAG GX'ICATiOG 3100 

TOCAIQCAJG TEAGIGGGAA AAGT0QQQQG QGGICATEAT GTOCAAA1GG 3150 

TCTICATCAA GCIGGGQQOG CIGACAGGTA (Jb'JJALUlTlA TAAOCATCTT 3200 

AC00C A CIQC GQGACTOQQC CCAOQOGGOC CTAOGAGAOC TIQ0QSIQ0C 3250 

Q3TAGAG0QC GTOSICnCT OOQOCATOGA. GA0CAAQ3IC ATCAOCT33G 3300 

GAGCAGACAC O3CIG0GIGT GGGGACATCA TCTIOQGTCT AODOGICIOC 3350 

GCOCGAAG33 GGAAGGAGAT ATm'IUGGA COG3CIGA3A GTCTOGAAGG 3400 

GCAAGQ31GG OGAL'lULTIG CGCCCATCAC GGOCTACTOC CAACAAAOGC 3450 

GQGQGGTACT TOSTTQCATC ATCACIAGCC 1CACAGGGOG QGACAAGAAC 3500 

CAGGTCGAAG GGGAGGTICA AGTOGflTICr ADOGCAACAC AATCITIGCT 3550 

GGCGAOCIGC ATCAAOQQOG TGIQC TOGA C TG7ICEAOCAT G G 0QCIQ 3CT 3600 

QGAAGAOQCT AGCOGGTOCA AAAGGTCCAA TCAOOCAAAT GTACAOCAAT 3650 

GTAGAGCTOG ACCTOGTOGG CTO3CAGGOG OCO0O3O33 G CGOQCTOCAT 3700 

GACAQCATGC AGCT3TGGCA GCIOGGAOCT TDOTOGIC AOGAGACATC 3750 

CTGATGTCAT T00GGT90GC 03GOGAGGGG ACAGCAGGGG AAGTCTACTC 3800 
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tqooocaggc uu mcic cm ocigaaaggc toctogqgig giojatiult 3850 

Trocmnas gsgcaggtce Tggasicrr googgcigct gtcigcbooc 3900 

G GSGG STO GC GAAGS03GIG GALT1UATBC OOGTIGAGIC TATOSAAACT 3950 

AOCATOOGST ClUJGGldT CACAGACAAC TCAAOOXDC O3X1GIA0C 4000 

GCAGACATIC CAAGIGSCAC ATCIQCAOQC TOCHJOGGC AGG3GCAAGA 4050 

QCAOCAAAGT QQOGGCIGOG TA1QCAGO0C AAGGGTACAA Q3IGCTO3TC 4100 

CIGAAO00GT (JUbTiUULUL 1 CAQCTiaGGG TriU U U UO G T AIATCKERA 4150 

QGCACA033T ATOGACOCIA ACATCAGAAC TOQGGIBAQS ADCATUCCA 4200 

CUGGGGXIC CAIITOOGriaC TOCACCTATC GGAAUl'lULT TOCX33A033T 4250 

QQCIGTICIG G3GGO30CIA T3ACATCAHA A3Mt3IGA3G AGTOOCACTC 4300 

AACIGACIOG ACTOOCATCT T333CMD33 CBCAGTOCIG GAOCAAO033 4350 

AGA033CIGG AQ0G09GCIC UlUb ' lUL ' lUG CCAOOQCIM: AOCTO0393A 4400 

'l CUb Ti^ OOS 1GOGAGAQQC CAATATOGAG GAAA1AQ3X TGTOCAACAA, 4450 

TQGAGAGATC CULTiUIATC GCAAAOXAT CCXX30TGM3 GOCA3CAAGG 4500 

GGG9GAQQCA TCICATITIC TOOCATICCA AGAAGAAAIG 1GAOGAGCTC 4550 

GOOQCAAAOC T3ACAQG0CT OQGACIGAAC GCIGEAOCAT ATTAOOGGQG 4600 

OCTIGATGIG TQQGTCA3AC COXT AI OGG AGACGTOGTT GT0GIQ3CAA 4650 

CAGACGCICT AATC A QQQGT TICAGQGGOG ATOTGACIC AGTGATOGAC 4700 

TQCAATACAT GI13ICAO0CA GACAGIOGAC TTCAGCTIGG ATOOCACCIT 4750 

CAOCATIGAG AOGAOGAOOG T3QCOCAAGA OQ03SIGTOG 03CIO3CAAC 4800 

O3CGA0GTAG AACIQQCAGG QGTAGGAGIG GCATCBOG GTTIGIGACT 4850 

OCAGGAGAAC GG00CIO3QG CATCTTOGAT TCTTOGGTOC TSTGIGAGIG 4900 

CTAIGADQOG GGCIGIGCTT GGTATCAGCT CAOGOOOQCT GAGAOCIOSG 4950 

TIAGGTIQOG GGCTEftDCTA AATACACCAG G3TIQQQ0GT CIGOCAQGAC 5000 

CATCIGGAGT TCIGGGAGAG CGICTICACA G30CTCA00C ACATAGAT3C 5050 

CCACTIGCIG TOCCAGACTA AACAGQCAGG AGACAACnT OCITAOCIGG 5100 

TQGCATATCA AGCEWZAGIG TGOGOCAGGG CICAAGCTCC AOCIQCAT0G 5150 

TGGGACCAAA TGTOGAAGIG TC1CATACX33 CIGAAAOCm CACIGCAOQG 5200 

GOCAACAO0C CHjC I GTft XA GGCIAGGAGC OGTGCAAAAT GAGb'lCAlOC 5250 

TCACACAOOC CAIAACIAAA TACATCA1GG CATOCAIGTC GGCIGADCIG 5300 

GAGGTOGTCA CEfiGCAOCIG GSIGCTG5 E A GG0GGAGT0C T1UUAULT1T 5350 

GG003CATAC TGQCIGAOGA CAGGGAGIGT OSICATIGIG GGCAGGATCA 5400 

lUnGIC C GG GAAGOCAGCr UlUbTiUO OG ACAQGGAAGT OOICIAOCAG 5450 

GAGTTOGAIG AGATGGAAGA GIGIGOCICA CAALTlUL'iT ACATO3AGCA 5500 

GGGAATQCAG CTOGOCGAGC AATICAAGCA AAAGQOQCIC GUUTIUI'IUC 5550 

AAAGQGCCAC CAAGCAAGOG GAGGCIGCIG CTOOOGTGGT GGAGTG CAAG 5600 

TQQQGAGOQC TTGAGAOdT CIGG90GAAG CACAIGIGGA ATI'ICAICAG 5650 

OGGAA3ACAG TAOCTAGCAG GCTEATOCAC TCIGOCIGGA AAOOOOQOGA 5700 
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TAQCATCArr GKIQGCKITr ACAULT1C1A TC2CTPGCCC GCVCACCPEC 5750 

CAAAACROOC TOCIGITD A CftTCTIQ333 QGM3GGSIQS CIGQGCftftCT 5800 

oGcrocrooc agoqcioogt cagctttogt gggoqooqsc atosooqgag 5850 

OQGCIGIT3G CAGGATAG3C CITO3G&AGG TQCTOGflQGA C MCTIGGOS 5900 

GQCTATOGOS CfiQ333TAQC: OGQOSGACIC GP33ULT1'1A AGSICMGAG 5950 

QGQOGAGSIG OOCIOGAOGS AGGAGC3GGT CAACmOC OL'IUULKPOC 6000 

TOCICCIGG TCOJL'lOjlC GIOGGGSIC G TCIGOQCAGC AMSOGOGT 6050 

OQGCAOSTGG GCXXX333AGA G33QQCIGIG CAGTO3A1GA. XX333CX2 K T 6100 

AQ35TT0GCT T0SOSQ3GTA, AaC ft CJG?lC'lC OOCiaOQCAC T AIUIUJUL ' IU 6150 

JOGOSACOC IGGRGCaOSr GiaOCAGA. TOCTCTCia G CXJTOOCaTC 6200 

ACICAACIQC TGAAGOGGCT OCAOCAGIG3 ATI&ATGAGG ACIGCILTAC 6250 

QOCKKJCIOC Q3CimiGGC' TAAQ33A1GT TTOGGAITOG KTKK3CP03G 6300 

TCITCACIGA CTICAAGAOC TOGClUCft CT OCAAACIOCT G O O QOGSnA 6350 

O033GAGIDC CTTIDCIGIC ATOOCAA03C GQSIBCAAGG GA CTCIQ30G 6400 

Q3033A0Q3C ATCATOCAAA OCACCIQOOC ATOOGGAGCA. CAGATOGOOG 6450 

GACATCKAA ARflO GGTIO C ATGAGGATOG TAQ330CTAG AAOCTOCAQC 6500 

AACAOGTOSC AQ33AA03TT QOOCATCAAC QCA3»CAOCA OGQGAOCTXG 6550 

CACftCCClCC OQGGQGOGCA ACEATTCCAG GGOGCEATQG QOa a ULl'lli 6600 

CIGAQ3AGIA 0GIGGAG3TT AOG0GIGTO3 QQGKIT1ULA CIBOGTGAOG 6650 

GQCATCAQCA CT3ACAAOGT AAAGIQQOCA TQOCAGSnC CQ3QQ000GA 6700 

ATICTICAOG GAGSIOGATC GMUXXX33TT GCACAG3EAC GCT0GQQQGT 6750 

GCAAACCICT TCIA03GGAG GAQGTCAOGT TCCAGGID33 GCICAAOCAA 6800 

TACTIGGT0G GGTQQCAGCT COCA3QDGAG QOQGAA003G AGGTAACAGT 6850 

GCTDOT0C ATQCICACOG AlCO-'lOXA CATBOGCA GAGAOQ3CIA 6900 

AQCGEAG3CT QGCIAGAGGG TCTOCXXXCT CITIAGOCAG CICATCAGCT 6950 

ASQCAGTIGT CIQQQOCTIC TTIGAAQQ03 ACA3GGACIA Q0CACCA3GA 7000 

CKXX30QGAC QCTGAOCICA TOGAQ3XAA OCICTIGIOG OQ3CAQGAGA 7050 

TOGG03GAAA CATCACKDQC GIGGAG?ICAG AGAATAAGGT AGIAATICIG 7100 

GA CTCTITGG AAOOQCTICA QQQQGAQGGG GATGAGAGGG AGAIATOD3T 7150 

0G0G9QQ3AG AT0CIGO3AA AATOCAGGM GTIOXCICA G0GTTG00CA. 7200 

TATGGGCA03 CC03GACTAC AATCCTOCAC TOCIAGAGIC CIQGAAGGAC 7250 

OOQGACIAOG TOOCIOOGGT GGTACA033A tqoocatigc CAOCTACCAA 7300 

QGCTCCIOCA ATAOCAOCIC CAOQGAGAAA GAGGAOGGTT GTCCIGACAG 7350 

AATCCAATCT GILT1U1UOC TTQQGQGAGC T0Q0CACIAA GALL.T1UJGT 7400 

AQCTCXDQGAT OGTOGOQCGT TGATAG0G3C AO330GAO0G OOCnOCTGA 7450 

OCIGGO CTO C GA0GAQQ3IG ACAAAGGATC OGA UUl ' lUA G U US flO O CT 7500 

CCATOOO0QC OCTIGAAQQG GAGGOGGGGG AO300GATCT (2G0GAOQS3 7550 

TCTIGGTCIA COGTCAGTCA GGAG3CEAGT GAOGAIGIOG TCTOCIQCIC 7600 
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MTCTOCTAT AOGIG3ACAG G03XCIGAT CAGQ0CA3GC GCTOOQGAGG 7650 

AAAG1AAGCT G30CATCAAC 003TIGAGCA ACICITIQCT QOSICA QCAC 7700 

AACATOSTCT AOXCACAAC ATO00QCW3C GCAAGOCIOC QQCAGAAGAA. 7750 

QSICAOCrrr GACAGATIQC AAGTOCIGGA T3A1CAIEAC OSQGAQGTAC 7800 

TCAAQ3AGAT GAAG90GAAG GCGTOCACAG TEAAGGCIBA. QCTICEA1CT 7850 

AOBGAGGAGS (X'IGCAAQCT GACQ00OCCA CATKX390CA AATOCAAATT 7900 

TOQCPflGG G GCAAAGGAGG TOOQGAACCr KTCCN3CM33 GGCUl'lAAOC 7950 

AC KIOJUC'I C QGIGIGGGAG GACTIGCIQ3 AAGACACIGA, AACAQCAATT 8000 

GACADCAOCA TCATOGCAAA AA GIGAQ3TT TTCIQC3SICE AAOCAGAGAA 8050 

O9GAGQG0QC AAGOCAGCTC QOCTIMOGT ATIOXAGAC CIQ3GAGTIC 8100 

CTGIMGO GA GAAGATO30C CTTEAOGAGG TO3ICICCAC (JLTlUL'iUAG 8150 

QQ0GIGA3G3 GCIDCICATA. OQGATTICAA TACIOOGCCA. AGCAGCQ33T 8200 

OGA STTOCIG GItaAATAOCT QGAAATCAAA GAAAIQQCCT A!1G J J L T1 L T 8250 

CATAIGACAC OOQL'lUl'l'l'l' GACTCAA033 TCACIGAGAG TGACATICGT 8300 

GTIGAGGAGT CAATmOCA A3GTIGIGAC TiOGGOOOOG AGQOCAGACA 8350 

Q50CA2AAGG T0GCIC A CAG AQOOXTTIA CATO9393ST OQOC1GACTA 8400 

ACTCAAAAGG GCAGAACIGC GGTTAICGCC QGT3CGGCGC AAGIGGGGTG 8450 

CTGAOGACIA GCIOQGGTAA TAOX3CACA TSTEOTGA AG33CAC3GC 8500 

AGCCTGTQGA GCIGCAAAQC TOCAGGACIG CAGGAIQCIC GIGAAOQGAG 8550 

ACGACCTIUT OSITAICTGT GAAAGOQOQG GAACCCAGGA GGAIGCG3GG 8600 

QCX3CTACGAG OCITCAOGGA GGCEA1GACT AGGTATICGG CCCCCCCCG3 8650 

GGATCQGCCC CAAOCAGAAT ACGAGCIGGA GCIGA1AACA TCAIUl'lUCT 8700 

OCAA1GIGIC AGTCGCGCAC GAJGCATCIG GCAAAAGGGT A3ACIACCTC 8750 

ACCCGTGACC CCACCACCCC UJl' l GCA OGG GCTG0STG93 AGACAGCTAG 8800 

ACACACIGCA ATCAACICIT GGCTAGGCAA TATCAICAIG UflGOSOGCA. 8850 

CCCTATGGGC AAGGAIGATT CIGATCACIC ACTTITICTC CATOLTIL'IA 8900 

GCICAAGAGC AACTIGAAAA AGCCCTOGAT TCICAGATCT ACGGGGCTTC 8950 

CTACTOCATT GAGCCACTIG AOCIAOCICA GATCAHGAA GGACTOCAIG 9000 

G1UTTAG0QC ATITACAC1C OOGTEACT CTQCAG3TGA, GA3CAATAGG 9050 

GTOGCnCAT GOCICAGGAA ACTIGGGGTA CCACCCTIGC GAAGCIGGAG 9100 

ACATOGGGCC AGAAGIGTCC GCGCTAAGCT ACIGIOOCAG GQQQQGAGGG 9150 

OCGXACTIG T3QCAGATAC CICTITAACT QGGCAGBVAG GAOCAAGCIT 9200 

AAACICACIC CAA2O00QGC CGCGTCCCAG CIGGACTIGT CIGGCIGGTT 9250 

CGIUXIGG T TACAQCGQQG GAGACATATA. TCACAGGCIG TCICGIQQCC 9300 

GAOCCGGCIG GTTICCGTIG TOQCTACICC TACITICIGT AGG33TAGQC 9350 

ATTIAOCTGC TCQQCAACCG ATGAACG3GG AGCTAACCAC TCCAGGOCIT 9400 

AAQocATnc ciGnnnT TnTrrrrrr TrmnnT Tcrnnnr 9450 

Tncrnocr TrocncnT Trrrocrnc Trrnarrr ctteaaiggt 9500 
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Q3CroCATCT TBUUULTAGT CMJ33CJ2G2 T3IGAMG3T O0SIGRGQ08 9550 
CAIGACTOCA GAGM3IGCHG M7OQ30CT CTCIQCAGKr CATGT 9595 

FIG. 7F 
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MSINFKFQRK 1KKNINRRPQ DVKFFG93QI W33VXLLPHR GERLGVRA3R 50 

KASERSQHS3 KRQPIPKARR PEERAWAQPG YPWPLYGNEG LGWftGWTJSP 100 

FGSRPSWGPT EEEKRSRNLG KVliUlUUGF AEELM3YIPIA7 GAPH33AARA 150 

IAH3VRVLED GWKA3GNLP GCSFSUIIA LL9CLTTPAS MEVRNV93I 200 

YHVINDCSC SIVYEAAEWI MHTPGCVFCV QB3^SSRCWV AUTPILAARN 250 

ASVPTITIER HVDLIM3EAA PCSAMA^SX. 03SIFLVSQL FIFSH^RHET 300 

VDPOCSIYP GHVSGHRMAW EfcMWWSPTT ALWSQE1RI FQWVCMSfcG 350 

AHWGVLAGLA YY3WGNWAK VLIVALLEAG VOGEIHTIGR VAGHTISGFT 400 

SLFS9GA9QK IQLVN3N3SW HINKEALN3J DSLQIGFEAA IFffiHKFNSS 450 

QCFEKMA9CR PIEWF^QQWG PTIYIKFNSS DQRFXOWHYA PRPCGWEAS 500 

GMJGPVXCFT PSPWVUi'lD RSGVPIYSWG ENEItWMLIN NIRPFQ3WF 550 

GCIVMNSIGF 1RTOQGPECN IG3W33KILI CFIDCERKHP EKEYIKDS9G 600 

PWLTPRCLVD YFYRLWHYPC TLNFSIFKVR MYW33VEHRL NftAQWIKS 650 

FOJLHDRDRS ELSPUISTT EWQILPCAFT TLPALSTGU HEH3NIVEM2 700 

YLYGM3SAFV SEAIKWEYIL LLFIIIADAR VCAOJM^Hi IAQAEAALEN 750 

LWLNAASVA GAB3ILSFLV FTCAAWYIKG KLAFGAA5ZAF YGVWPLLLLL 800 

LALPPRA¥AL EREMAASOQG AVUVGLVFLT LSPYYKVFLT FLBftWLQYFT 850 

TRAEAHM2VW VPPIMVRGGR DAIILLTCAV HPET.TFDUK LLLATTGFLM 900 

VLQAGTTRVP YFVRAQGLIR ACMLVRKVAG GHTOWFMK D3ALTCTWY 950 

NHLTPLJ3DWA HAGLEDLAVA VEPWFSAME TKVTIWGADT AAGGUTLLGL 1000 

PVSARRGKEI FIGPADSLEE CGWRLLAPIT AY9QQIK3VL GCUTSLTGR 1050 

EKNgVEGEMQ WSIMQSFL ATCINSVCWT VYH3AGSKIL AGPKGPITQM 1100 

YTNVEOXVG WQAPPGARSM TPCSOGSSDL YLVIEHAEVI PVRRRESRG 1150 

SLLSERPVSY IKGS9GGPLL CP9GHWGVF RAAVCIPGWA KAVEFUVES 1200 

METIMRSPVF TCNSTPPAVP CflFQVAHLHA PTGSGKSIKV PAAYAAQGYK 1250 

VLVLNPSVAA HjGFGAXMSK AHGHJPNIFT GVKI'JLTILUS nYSTXGKFI, 1300 

ADGQC9QGAY DIIICCECHS TOSTTILGIG TVLDQAEEH3 ARLWLATAT 1350 

PFGSVIVPHP NXEEEGLaON GEZPFYGKAI PIEAIKGGRH UFCHSKKKC 1400 

EELAAKLT3L GLNAVAYYPG UVSVIPPIG DWWAIDAL MIGPTOCFDS 1450 

VIIXMICVIQ 1VEFSU3PIF TlKmVHJD AVSRSQpRGR TGRggGKR 1500 

FVIPGERPSG MFDSSVLCEC YDAGCAWYEL TPAEISVRLR AYLNIPGLPV 1550 

OQEHLEFWES VFIGLTHUA HFLSQIKQAG ENEPXLVA2Q A3VZARAQAP 1600 

PPSWDCMWKC LIRLKPILH3 PTPLLYPLGA V3NEVHJIHP nKHMACMS 1650 

ADLEWTSIW VLVGGVLAAL AAYCLTIGSV VIU3RIIISG KPAWPEREV 1700 

LYQEFEEMEE CASQLPYIEQ GM3LABQFKQ KAIGLLGTAT KQAEAAAPW 1750 

ESKWRALEIF WAKHMftNFIS GIQYLAGLST LPGNPAIASL MAFTASTTSP 1800 

LTT3mUN ILGGWVAAQL APPSAASAFV GAGIAGAAV3 SIGDGKVLWD 1850 

ILAGYGAGVA GALVAFKVMS GEVPSIUXV NLLPAILSPG ALWGWCAA 1900 
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10 20 30 40 SO 

1234567890 1234567890 1234567890 1234567890 1334567800 

ILHRHVGFGB GAVQWNRLI AEASKGNHVS PIHWPESEA AftKVIQUSS 1950 

urrram<RL hswinedcst pcsgswl^d/ vewicivltd fkiwdqskll 2000 

PRLPGVPFLS OQRGYKEVWR GDG3M2ITCP OGftQIAGHVK NG94RIW3PR 2050 

TCa^HGTIF PINAYTIGFC TPSPAHWSR ALWRVAAEEY VEVIRUGCSH 2100 

YVTGMTHNV KCPO07PAPE FFIEVD3VRL HKXAPAOKPL LRED/DF^/G 2150 

UJjm/G33L PCEFEFD/IV LTSMLTDPSH nMH^KRHL ARGSPPSLAS 2200 

SSASCLSAPS IXATCTIHHD SPDMUEBN I1W<EM3GN riEVESENKV 2250 

vrrrsFEPiH abxekeisv aaetl^ksrk fpsalpiwar peknppues 2300 

VKDPDWPP7 VH3CPLPFIK APPIPPPRRK F3WDIESN7 SSALAELATK 2350 

1FGSS3SSAV LbUiMMfD IAS'I I G H3S EWESXSSMPP LEEEPGDHX 2400 

SDGSWSIVSE EASEHWCCS MSXTWIGMJ TFCAAEESKL PINELSNSLL 2450 

SRSASLPQKK VTFERU^LD EHXFEWLKEM KAKASIVKAK 2500 

LLSIEEACKL TPFHSAKSKF GYGAKEVRNL SSRAVNHIRS VWEELUDIE 2550 

TP1ULT1MAK SEVPCVQPEX GGRKPARLIV FPDU3VFWCE KMMjEDWST 2600 

LPQAVMSSSY GPQYSPKCKV EFLVNIWKSK KZPM3FSYDT RCFESIVIES 2650 

DIRVEESIYQ OOXAFEARQ AIESUEERLY IQGPLTNSKG QXJHBBCBfL 2700 

SGrVLTISQGN TL1UYLKAIA. ACPAAKLQDC VICESAGIQE 2750 

EAAALRAFIE AMTRYSAPPG DPPQPEYELE LTISCSSNVS VAHEftSGKRV 2800 

YYLTRDPTTP LARAAWEEAR HTPINSWLGN IDttAFIim RMUMZHFFS 2850 

IHAQEQLEK ALDC3QIYGAC YSIEPLDLPQ IHKLH3LSA FILHSYSPGE 2900 

INRVASCLRK U3VPPLKIWR HRARSVRAKL L9Q9GRAKIC GFQlfNWAVR 2950 

TKLKLTPIPA ASQLDLSQWF VAGY33X»IY HSLSRARPPW FPLCLLLLSV 3000 

GVKSIYLLfNR 3010 
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<110> Yanagi, Masayuki 
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Bukh, Jens 
Purcell, Robert 

<120> Cloned Genome of Infectious Hepatitis C Viruses of 
Genotype 2a and Uses Thereof 

<130> 20264302PC 

<140> TBA 

<141> 2000-06-02 

<150> 60/137,693 
<151> 1999-06-04 

<160> 39 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 9711 
<212> DNA 

<213> Hepatitis C virus 



<400> 1 



acccgcccct 


aataggggcg 


acactccgcc 


atgaatcact 


cccctgtgag 


gaactactgt 


60 


cttcacgcag 


aaagcgtcta 


gccatggcgt 


tagtatgagt 


gtcgtacagc 


ctccaggccc 


120 


ccccctcccg 


ggagagccat 


agtggtctgc 


ggaaccggtg 


agtacaccgg 


aattgccggg 


180 


aagactgggt 


cctttcttgg 


ataaacccac 


tctatgcccg 


gccatttggg 


cgtgcccccg 


240 


caagactgct 


agccgagtag 


cgttgggttg 


cgaaaggcct 


tgtggtactg 


cctgataggg 


300 


tgcttgcgag 


tgccccggga 


ggtctcgtag 


accgtgcacc 


atgagcacaa 


atcctaaacc 


360 


tcaaagaaaa 


accaaaagaa 


acaccaaccg 


tcgcccacaa 


gacgttaagt 


ttccgggcgg 


420 


cggccagatc 


gttggcggag 


tatacttgtt 


gccgcgcagg 


ggccccaggt 


tgggtgtgcg 


480 


cgcgacaagg 


aagacttcgg 


agcggtccca 


gccacgtgga 


aggcgccagc 


ccatccctaa 


540 


agatcggcgc 


tccactggca 


aatcctgggg 


aaaaccagga 


tacccctggc 


ccctatacgg 


600 


gaatgaggga 


ctcggctggg 


caggatggct 


cctgtccccc 


cgaggttccc 


gtccctcttg 


660 


gggccccaat 


gacccccggc 


ataggtcgcg 


caacgtgggt 


aaggtcatcg 


ataccctaac 


720 


gtgcggcttt 


gccgacctca 


tggggtacat 


ccctgtcgtg 


ggcgccccgc 


tcggcggcgt 


780 


cgccagagct 


ctcgcgcatg 


gcgtgagagt 


cctggaggac 


ggggttaatt 


ttgcaacagg 


840 


gaacttaccc 


ggttgctcct 


tttctatctt 


cttgctggcc 


ctgctgtcct 


gcatcaccac 


900 


cccggtctcc 


gctgccgaag 


tgaagaacat 


cagtaccggc 


tacatggtga 


ctaacgactg 


960 


caccaatgac 


agcattacct 


ggcagctcca 


ggctgctgtc 


ctccacgtcc 


ccgggtgcgt 


1020 


cccgtgcgag 


aaagtgggga 


atgcatctca 


gtgctggata 


ccggtctcac 


cgaatgtggc 


1080 


cgtgcagcgg 


cccggcgccc 


tcacgcaggg 


cttgcggacg 


cacatcgaca 


tggttgtgat 


1140 


gtccgccacg 


ctctgctctg 


ccctctacgt 


gggggacctc 


tgcggtgggg 


tgatgctcgc 


1200 
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agcccaaatg ttcattgtct cgccgcagca 
catctaccct ggtaccatca ctggacaccg 
gcccacggct accatgatct tggcgtacgc 
cattagcggg gctcattggg gcgtcatgtt 
gtgggcgaaa gtcgttgtca tccttctgtt 
tgttgggggt tctgccgcgc agaccaccgg 
caggcagaaa atccagctcg ttaacaccaa 
gaactgcaat gactccttgc acaccggctt 
caactcgtca ggatgtcccg aacgcatgtc 
gggatggggc gccttgcaat atgaggataa 
ttgctggcac tacccaccaa ggcagtgtgg 
agtgtactgt ttcaccccca gcccagtggt 
cacttacacg tggggggaga atgagacaga 
gctggggtca tggttcggct gcacgtggat 
cgcaccaccc tgccgtacta gagctgactt 
ggactgtttt aggaagcatc ctgataccac 
cacgccaagg tgcctgatcg actaccccta 
ctataccatc ttcaaaataa ggatgtatgt 
atgcaatttc actcgtgggg atcgttgcaa 
tcctttgttg cactccacca cggaatgggc 
cgccttgtcg actggtcttc tccacctcca 
tggcctatca cctgccctca caaaatacat 
cctgctctta gcggacgcca gggtttgcgc 
ggccgaagca gcactagaga agctggtcat 
tggcttccta tattttgtca tctttttcgt 
ccccttagct acctattccc tcactggcct 
gccccaacag gcttatgctt atgacgcatc 
ggtaatgatc actctcttta ctctcacccc 
gtggtggttg tgctatcttc tgaccctggg 
tatgcaggtg cgcggtggcc gtgatggcat 
tgtggtgttt gacataacca agtggctctt 
aggtgctttg acgcgcgtgc cgtacttcgt 
catggcaagg catctcgcgg ggggcaggta 
gtggactggc acttacatct atgaccacct 
cctgcgggac ctggcggtcg ccgttgagcc 
cattgtctgg ggagcggaga cagctgcttg 
cgcccgactt ggtcgggagg tcctccttgg 
gagtcttctc gcccccatca ctgcttacgc 
agtggtgagc atgacggggc gcgacaagac 
cacagtcact cagtccttcc tcggaacatc 
tggagctggc aacaagactc tggccggctc 
tgctgagggg gacttagtag ggtggcccag 
cacgtgtgga gcggtcgacc tgtacctggt 
aagacgcggg gacaaacggg gagcgctact 
gtcctcagga ggcccggtgc tatgccccag 
tgtgtgctct cggggcgtgg ctaagtccat 
cgtcacgcgg tcccccacct ttagtgacaa 
tcaggtcggg tacttgcatg ccccgactgg 



ccactggttt gtccaagact gcaattgctc 1260 
catggcatgg gacatgatga tgaactggtc 1320 
gatgcgtgtc cccgaggtca ttatagacat 1380 
cggcttggcc tacttctcta tgcagggagc 1440 
ggccgccggg gtggacgcgc gcacccatac 1500 
gcgcctcacc agcttatttg acatgggccc 1560 
tggcagctgg cacatcaacc gcaccgccct 1620 
tatcgcgtct ctgttctaca cccacagctt 1680 
cgcctgccgc agtatcgagg ccttccgggt 1740 
tgtcaccaat ccagaggata tgagacccta 1800 
cgtggtctcc gcgaagactg tgtgtggccc 1860 
agtgggcacg accgacaggc ttggagcgcc 1920 
tgtcttccta ttgaacagca ctcgaccacc 1980 
gaactcttct ggctacacca agacttgcgg 2040 
caacgccagc acggacctgt tgtgccccac 2100 
ttacctcaaa tgcggctctg ggccctggct 2160 
caggctctgg cattacccct gcacagttaa 2220 
gggaggggtt gagcacaggc tcacggctgc 2280 
cttggaggac agagacagaa gtcaactgtc 2340 
cattttacct tgctcttact cggacctgcc 2400 
ccaaaacatc gtggacgtac aattcatgta 2460 
cgtccgatgg gagtgggtaa tactcttatt 2520 
ctgcttatgg atgctcatct tgttgggcca 2580 
cttgcacgct gcgagcgcag ctagctgcaa 2640 
ggctgcttgg tacatcaagg gtcgggtagt 2700 
gtggtccttt agcctactgc tcctagcatt 2760 
tgtgcatggc cagataggag cggctctgct 2820 
cgggtataag acccttctca gccggttttt 2880 
ggaagctatg gtccaggagt gggcaccacc 2940 
catatgggcc gtcgccatat tctacccagg 3000 
ggcggtgctt gggcctgctt acctcctaaa 3060 
cagggctcac gctctactga ggatgtgcac 3120 
cgtccagatg gcgctactag cccttggcag 3180 
cacccctatg tcggattggg ctgctagtgg 3240 
tatcatcttc agtccgatgg agaagaaagt 3300 
tggggacatt ttacacggac ttcccgtgtc 3360 
cccagctgat ggctatacct ccaaggggtg 3420 
ccagcagaca cgtggccttt tgggcaccat 3480 
agaacaggct ggggaaattc aggtcctgtc 3540 
catctcgggg gttttgtgga ctgtctacca 3600 
acggggtccg gtcacgcaga tgtactccag 3660 
cccccctggg actaaatctt tggagccgtg 3720 
cacgcggaac gctgatgtca tcccggctcg 3780 
ctccccgaga cctctttcca ccttgaaggg 3840 
gggccacgct gtcggagtct tccgggcagc 3900 
agatttcatc cccgttgaga cactcgacat 3960 
cagcacacca cctgctgtgc cccagaccta 4020 
cagtggaaag agcaccaaag ttcctgtcgc 4080 
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atatgctgct caggggtata aagtgctagt 
gtttggggcg tacttgtcta aggcacatgg 
gactgtgacg accggggcgc ccatcacgta 
gggctgtgcg ggcggcgcct acgacatcat 
taccaccatc cttggcatcg gaacagtcct 
aactgtgctg gctacagcta cgccccctgg 
ggaggtggcc cttgggcagg agggcgagat 
ttacatcaag ggaggaagac atctgatctt 
cgcggcggcc cttcggggta tgggcttgaa 
ctccgtaata ccaactcagg gagacgtagt 
gtatactggg gactttgact ccgtgatcga 
cttcagttta gaccccacat tcaccataac 
acgtagccag cgccggggtc gcacgggtag 
cactggtgag cgagcctcag gaatgtttga 
aggggccgca tggtatgagc tcacaccatc 
caacacgccc ggtttgcctg tgtgccaaga 
cggcctcaca cacatagatg cccacttcct 
cgcatactta acagcctacc aggctacagt 
ctgggacgtc atgtggaagt gtttgactcg 
tctcctgtac cgcttgggct ctgttaccaa 
atacatcgcc acctgcatgc aagccgacct 
agggggagtc ttggcggccg tcgccgcgta 
cggccgcttg cacattaacc agcgagccgt 
ggcttttgat gagatggagg aatgtgcctc 
gatagccgag atgctgaagt ccaagatcca 
tcaagacata caacccactg tgcaggcttc 
acacatgtgg aacttcatta gcggcatcca 
gaaccctgca gtagcttcca tgatggcgtt 
aagcaccact atccttctca acattttggg 
cgcgggggcc actggcttcg ttgtcagtgg 
cttaggtaag gtgctagtgg acatcctggc 
cgtcgcattc aagatcatgt ctggcgagaa 
gcctggaatt ctgtctccgg gtgccttggt 
ccgacacgtg ggaccggggg aaggcgccgt 
ttccagagga aatcacgtcg cccccaccca 
tgtgacccaa ctacttggct cccttaccat 
gattactgag gactgcccca tcccatgcgg 
ggtttgcacc atcctaacag actttaaaaa 
gcccggcctc ccctttgtct cctgtcaaaa 
catcatgacc acacggtgtc cttgcggcgc 
catgagaatc acggggccta agacctgcat 
ttgttacacg gagggccagt gcgtgccgaa 
gagggtggcg gcctcagagt acgcggaggt 
aggactcacc actgataact tgaaagtccc 
ctgggtggac ggagtgcaga tccataggtt 
tgaggtctcg ttctgcgttg ggcttaattc 
ccctgaaccc gacacagacg tattgatgtc 
ggagactgca gcgcggcgtt tagcgcgggg 



gcttaatccc tcagtggctg ccaccctggg 4140 
catcaatccc aacattagga ctggagtcag 4200 
ctccacatat ggcaaattcc tcgccgatgg 4260 
catatgtgat gaatgccatg ccgtggactc 4320 
tgatcaagca gagacagctg gggtcagact 4380 
gtcagtgaca accccccacc ccaacataga 4440 
ccccttctat gggagggcga ttcccctgtc 4500 
ctgccattca aagaaaaagt gtgacgagct 4560 
ctcagtggca tactacagag ggttggacgt 4620 
ggtcgtcgcc accgacgccc tcatgacagg 4680 
ctgcaacgta gcggtcactc aagttgtaga 4740 
cacacagatt gtccctcaag acgctgtctc 480O 
gggaagactg ggcatttata ggtatgtttc 4860 
cagtgtagtg ctctgtgagt gctacgacgc 4920 
ggagaccacc gtcaggctca gggcgtattt 4980 
ccatcttgag ttttgggagg cagttttcac 5040 
ttcccaaaca aagcaatcgg gggaaaattt 5100 
gtgcgctagg gccaaagccc cccccccgtc 5160 
actcaagccc acactcgtgg gccccacacc 5220 
cgaggtcacc ctcacacatc ccgtgacgaa 5280 
tgaggtcatg accagcacat gggtcttggc 5340 
ttgcctggcg accgggtgtg tttgcatcat 5400 
cgttgcgccg gacaaggagg tcctctatga 5460 
tagggcggct ctcattgaag aggggcagcg 5520 
aggcttattg cagcaagctt ccaaacaagc 5580 
atggcccaag gtagaacaat tctgggccaa 5640 
atacctcgca ggactatcaa cactgccagg 5700 
cagtgccgcc. ctcaccagtc cgctgtcaac 5760 
gggctggcta gcatcccaaa ttgcaccacc 5820 
cctagtggga gctgccgtag gcagtatagg 5880 
agggtatggt gcgggcattt cgggggctct 5940 
gccctccatg gaggatgtcg tcaacttgct 6000 
agtgggagtc atctgcgcgg ccattctgcg 6060 
ccaatggatg aatagactca ttgcctttgc 6120 
ctacgtgacg gagtcggatg cgtcgcagcg 6180 
aaccagcctg ctcagaagac tccacaactg 6240 
cggctcgtgg ctccgcgatg tgtgggactg 6300 
ttggctgacc tccaaattat tcccaaagat 6360 
ggggtacaag ggcgtgtggg ccggcactgg 6420 
caatatctct ggcaatgtcc gcttgggctc 6480 
gaatatctgg caggggacct ttcctatcaa 6540 
acccgcgcca aactttaagg tcgccatctg 6600 
gacgcagcac gggtcatacc actacataac 6660 
ctgccaacta ccctctcccg agttcttttc 6720 
tgcccccaca ccgaagccgt ttttccggga 6780 
atttgtcgtc gggtcccagc ttccttgcga 6840 
catgctaaca gatccatctc atatcacggc 6900 
gtcaccccca tccgaggcaa gctcctcggc 6960 
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gagccagcta tcggcaccat cgctgcgagc cacctgcacc acccacggca aagcctatga 7020 
tgtggacatg gtggatgcta acctgttcat ggggggcgat gtgactcgga tagagtctgg 7080 
gtccaaagtg gtcgttctgg actctctcga cccaatggtc gaagaaagga gcgaccttga 7140 
gccttcgata ccatcagaat acatgctccc caagaagagg ttcccaccag ctttaccggc 7200 
ctgggcacgg cctgattaca acccaccgct tgtggaatcg tggaaaaggc cagattacca 7260 
accggccact gttgcgggct gtgctctccc tcctcctagg aaaaccccga cgcctccccc 7320 
aaggaggcgc cggacagtgg gcctaagtga ggactccata ggagatgccc ttcaacagct 7380 
ggccattaag tcctttggcc agcccccccc aagcggcgat tcaggccttt ccacgggggc 7440 
gggcgctgcc gattccggca gtcagacgcc tcctgatgag ttggcccttt cggagacagg 7500 
ttccatctct tccatgcccc ccctcgaggg ggagcttgga gatccagacc tggagcctga 7560 
gcaggtagag ccccaacccc ccccccaggg gggggtggca gctcccggct cggactcggg 7620 
gtcctggtct acttgctccg aggaggacga ctccgtcgtg tgctgctcca tgtcatactc 7680 
ctggaccggg gctctaataa ctccttgtag tcccgaagag gagaagttac cgattaaccc 7740 
cttgagcaac tccctgttgc gatatcacaa caaggtgtac tgtaccacaa caaagagcgc 7800 
ctcactaagg gctaaaaagg taacttttga taggatgcaa gtgctcgact cctactacga 7860 
ctcagtctta aaggacatta agctagcggc ctccaaggtc accgcaaggc tcctcaccat 7920 
ggaggaggct tgccagttaa ccccacccca ttctgcaaga tctaaatatg ggtttggggc 7980 
taaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 8040 
cctcctggag gactcagaaa caccaattcc cacaaccatt atggccaaaa atgaggtgtt 8100 
ctgcgtggac cccaccaagg ggggcaagaa agcagctcgc cttatcgttt accctgacct 8160 
cggcgtcagg gtctgcgaga agatggccct ttatgacatt acacaaaaac ttcctcaggc 8220 
ggtgatgggg gcttcttatg gattccagta ttcccccgct cagcgggtag agtttctctt 8280 
gaaagcatgg gcggaaaaga aggaccctat gggtttttcg tatgataccc gatgctttga 8340 
ctcaaccgtc actgagagag acatcaggac tgaggagtcc atatatcggg cctgctcctt 8400 
gcccgaggag gcccacactg ccatacactc gctaactgag agactttacg tgggagggcc 8460 
tatgttcaac agcaagggcc aaacctgcgg gtacaggcgt tgccgcgcca gcggggtgct 8520 
caccactagc atggggaaca ccatcacatg ctacgtgaaa gccttagcgg cttgtaaagc 8580 
tgcagggata atcgcgccca caatgctggt atgcggcgat gacttggttg tcatctcaga 8640 
aagccagggg accgaggagg acgagcggaa cctgagagcc ttcacggagg ctatgaccag 8700 
gtattctgcc cctcctggtg acccccccag accggagtat gatctggagc tgataacatc 8760 
ttgctcctca aatgtgtctg tggcgctggg cccacaaggc cgccgcagat actacctgac 8820 
cagagaccct accactccaa tcgcccgggc tgcctgggaa acagttagac actcccctgt 8880 
caattcatgg ctgggaaaca tcatccagta cgccccgacc atatgggctc gcatggtcct S940 
gatgacacac ttcttctcca ttctcatggc tcaagacacg ctggaccaga acctcaactt 9000 
tgagatgtac ggagcggtgt actccgtgag tcccttggac ctcccagcta taattgaaag 9060 
gttacatggg cttgacgctt tttctctgca cacatacact ccccacgaac tgacacgggt 9120 
ggcttcagcc ctcagaaaac ttggggcgcc acccctcaga gcgtggaaga gccgggcacg 9180 
tgcagtcagg gcgtccctca tctcccgtgg ggggagagcg gccgtttgcg gtcgatatct 9240 
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggaag cgcgcctcct 9300 
ggatttatcc agctggttca ccgtcggcgc cggcgggggc gacatttatc acagcgtgtc 9360 
gcgtgcccga ccccgcttat tgctctttgg cctactccta ctttttgtag gggtaggcct 9420 
tttcctactc cccgctcggt agagcggcac acattagcta cactccatag ctaactgtcc 9480 
cttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 9540 
tttttttttt tttttttttt tttttctttt tttctctttt ccttctttct taccttattt 9600 
tactttcttt cctggtggct ccatcttagc cctagtcacg gctagctgtg aaaggtccgt 9660 
gagccgcatg actgcagaga gtgccgtaac tggtctctct gcagatcatg t 9711 
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<210> 2 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 2 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys lie Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn He Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 
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Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp lie 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His lie Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe lie Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser lie Tyr Pro Gly Thr lie Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met lie Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 



Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 
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Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 

Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 665 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 



Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 
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Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val lie Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val lie Phe Phe Val Ala Ala Trp Tyr lie Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 
805 810 815 

Ser Val His Gly Gin He Gly Ala Ala Leu Leu Val Met He Thr Leu 
820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Arg Phe Leu Trp 
835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met Val Gin Glu Trp 
850 855 860 

Ala Pro Pro Met Gin Val Arg Gly Gly Arg Asp Gly He He Trp Ala 
865 870 875 880 

Val Ala He Phe Tyr Pro Gly Val Val Phe Asp He Thr Lys Trp Leu 
885 890 895 

Leu Ala Val Leu Gly Pro Ala Tyr Leu Leu Lys Gly Ala Leu Thr Arg 
900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Met Cys Thr Met 
915 920 925 

Ala Arg His Leu Ala Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala 
930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met 
945 950 955 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 



Pro He He Phe Ser Pro Met Glu Lys Lys Val He Val Trp Gly Ala 
980 985 990 
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Glu Thr Ala Ala Cys Gly Asp lie Leu His Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Leu Gly Arg Glu Val Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 
1010 1015 1020 

Lys Gly Trp Ser Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Thr He Val Val Ser Met Thr Gly Arg Asp Lys 
1045 1050 1055 

Thr Glu Gin Ala Gly Glu lie Gin Val Leu Ser Thr Val Thr Gin Ser 
1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 
1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 
1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly Val Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp 
1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 
1220 1225 1230^ 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 
1235 1240 1245 
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Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly lie Asn Pro 
1265 1270 1275 1280 

Asn- lie Arg Thr Gly Val Arg Thr Val Thr Thr Gly Ala Pro lie Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 
1300 1305 1310 

Ala Tyr Asp lie lie lie CyB Asp Glu Cys His Ala Val Asp Ser Thr 
1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 
1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Tyr He Lys Gly Gly 
1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ser Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Val Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Thr Thr Gin He Val Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 
1490 1495 1500 
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Tyx Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ser Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 

Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 
1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Cys He He Gly Arg Leu His He Asn Gin Arg Ala 
1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 
1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 
1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Thr Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 
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Val Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe lie Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 
1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 
1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 
1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser lie Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 
1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val Val Asn Leu Leu Pro 
1875 1880 1885 

Gly lie Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu lie Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 
1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 
1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 
1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Gly Gly Ser Trp Leu Arg Asp Val 
1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe Val Ser Cys Gin 
2005 2010 2015 
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Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 
2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 
2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn He Trp Gin Gly Thr Phe 
2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Val Pro Lys Pro Ala Pro 
2065 2070 2075 2080 

Asn Phe Lys Val Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 
2085 2090 2095 

Val Thr Gin His Gly Ser Tyr His Tyr He Thr Gly Leu Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 
2115 2120 2125 

Val Asp Gly Val Gin He His Arg Phe Ala Pro Thr Pro Lys Pro Phe 
2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Phe Val Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Asp Val Leu Met 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Thr Ala Ala Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Lys 
2210 2215 2220 

Ala Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Asp 
2225 2230 2235 2240 

Val Thr Arg He Glu Ser Gly Ser Lys Val Val Val Leu Asp Ser Leu 
2245 2250 2255 

Asp Pro Met Val Glu Glu Arg Ser Asp Leu Glu Pro Ser He Pro Ser 
2260 2265 2270 
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Glu Tyr Met Leu Pro Lys Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 
2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 
2290 2295 2300 

Asp Tyr Gin Pro Ala Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Arg 
2305 2310 2315 2320 

LyB Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 
2325 2330 2335 

Glu Asp Ser lie Gly Asp Ala Leu Gin Gin Leu Ala lie Lys Ser Phe 
2340 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Gly 
2355 2360 2365 

Ala Ala Asp Ser Gly Ser Gin Thr Pro Pro Asp Glu Leu Ala Leu Ser 
2370 2375 2380 

Glu Thr Gly Ser lie Ser Ser Met Pro Pro Leu Glu Gly Glu Leu Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Pro Gin Pro Pro Pro Gin 
2405 2410 2415 

Gly Gly Val Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 
2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 
2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 
2450 2455 2460 

He Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Thr Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 
2485 2490 2495 

Asp Arg Met Gin Val Leu Asp Ser Tyr Tyr Asp Ser Val Leu Lys Asp 
2500 2505 2510 

He Lys Leu Ala Ala Ser Lys Val Thr Ala Arg Leu Leu Thr Met Glu 
2515 2520 2525 
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Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 
2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Glu Thr Pro lie 
2565 2570 2575 

Pro Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Thr 
2580 2585 2590 

Lys Gly Gly Lys Lys Ala Ala Arg Leu lie Val Tyr Pro Asp Leu Gly 
2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp lie Thr Gin Lys Leu 
2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 
2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 
2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser He Tyr Arg Ala Cys Ser Leu Pro 
2675 2680 2685 

Glu Glu Ala His Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 
2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 
2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly He He Ala 
2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 
2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 
2770 2775 2780 
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Met Thr Arg Tyx Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 
2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 
2820 2825 2830 

Pro lie Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 
2835 2840 2845 

Ser Trp Leu Gly Asn lie lie Gin Tyr Ala Pro Thr lie Trp Ala Arg 
2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ala Val Tyr Ser Val 
2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 
2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 
2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 
2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 
2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 
2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Tyr His Ser Val Ser Arg 
2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 
3010 3015* 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 
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<210> 3 
<211> 9611 
<212> DNA 

<213> Hepatitis C virus 
<400> 3 

gccagccccc tgatgggggc gacactccac catgaatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gataaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcaagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcaca aatcctaaac 360 
ctcaaagaaa aaccaaaaga aacaccaacc gtcgcccaca agacgttaag tttccgggcg 420 
gcggccagat cgttggcgga gtatacttgt tgccgcgcag gggccccagg ttgggtgtgc 480 
gcgcgacaag gaagacttcg gagcggtccc agccacgtgg aaggcgccag cccatcccta 540 
aagatcggcg ctccactggc aaatcctggg gaaaaccagg atacccctgg cccctatacg 600 
ggaatgaggg actcggctgg gcaggatggc tcctgtcccc ccgaggttcc cgtccctctt 660 
ggggccccaa tgacccccgg cataggtcgc gcaacgtggg taaggtcatc gataccctaa 720 
cgtgcggctt tgccgacctc atggggtaca tccctgtcgt gggcgccccg ctcggcggcg 780 
tcgccagagc tctcgcgcat ggcgtgagag tcctggagga cggggttaat tttgcaacag 840 
ggaacttacc cggttgctcc ttttctatct tcttgctggc cctgctgtcc tgcatcacca 900 
ccccggtctc cgctgccgaa gtgaagaaca tcagtaccgg ctacatggtg actaacgact 960 
gcaccaatga cagcattacc tggcagctcc aggctgctgt cctccacgtc cccgggtgcg 1020 
tcccgtgcga gaaagtgggg aatgcatctc agtgctggat accggtctca ccgaatgtgg 1080 
ccgtgcagcg gcccggcgcc ctcacgcagg gcttgcggac gcacatcgac atggttgtga 1140 
tgtccgccac gctctgctct gccctctacg tgggggacct ctgcggtggg gtgatgctcg 1200 
cagcccaaat gttcattgtc tcgccgcagc accactggtt tgtccaagac tgcaattgct 1260 
ccatctaccc tggtaccatc actggacacc gcatggcatg ggacatgatg atgaactggt 1320 
cgcccacggc taccatgatc ttggcgtacg cgatgcgtgt ccccgaggtc attatagaca 1380 
tcattagcgg ggctcattgg ggcgtcatgt tcggcttggc ctacttctct atgcagggag 1440 
cgtgggcgaa agtcgttgtc atccttctgt tggccgccgg ggtggacgcg cgcacccata 1500 
ctgttggggg ttctgccgcg cagaccaccg ggcgcctcac cagcttattt gacatgggcc 1560 
ccaggcagaa aatccagctc gttaacacca atggcagctg gcacatcaac cgcaccgccc 1620 
tgaactgcaa tgactccttg cacaccggct ttatcgcgtc tctgttctac acccacagct 1680 
tcaactcgtc aggatgtccc gaacgcatgt ccgcctgccg cagtatcgag gccttccggg 1740 
tgggatgggg cgccttgcaa tatgaggata atgtcaccaa tccagaggat atgagaccct 1800 
attgctggca ctacccacca aggcagtgtg gcgtggtctc cgcgaagact gtgtgtggcc 1860 
cagtgtactg tttcaccccc agcccagtgg tagtgggcac gaccgacagg cttggagcgc 1920 
ccacttacac gtggggggag aatgagacag atgtcttcct attgaacagc actcgaccac 1980 
cgctggggtc atggttcggc tgcacgtgga tgaactcttc tggctacacc aagacttgcg 2040 
gcgcaccacc ctgccgtact agagctgact tcaacgccag cacggacctg ttgtgcccca 2100 
cggactgttt taggaagcat cctgatacca cttacctcaa atgcggctct gggccctggc 2160 
tcacgccaag gtgcctgatc gactacccct acaggctctg gcattacccc tgcacagtta 2220 
actataccat cttcaaaata aggatgtatg tgggaggggt tgagcacagg ctcacggctg 2280 
catgcaattt cactcgtggg gatcgttgca acttggagga cagagacaga agtcaactgt 2340 
ctcctttgtt gcactccacc acggaatggg ccattttacc ttgctcttac tcggacctgc 2400 
ccgccttgtc gactggtctt ctccacctcc accaaaacat cgtggacgta caattcatgt 2460 
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atggcctatc acctgccctc acaaaataca 
tcctgctctt agcggacgcc agggtttgcg 
aggccgaagc agcactagag aagctggtca 
atggcttcct atattttgtc atctttttcg 
tccccttagc tacctattcc ctcactggcc 
tgccccaaca ggcatatgca ctggacacgg 
ttgtcgggtt aatggcgctg actctgtcgc 
tgtggtggct tcagtatttt ctgaccagag 
ccctcaacgt ccgggggggg cgcgatgccg 
ccctggtatt tgacatcacc aaactactcc 
aagccagttt gcttaaagtc ccctacttcg 
cgctagcgcg gaagatagcc ggaggtcatt 
cgcttactgg cacctatgtg tataaccatc 
gcctgcgaga tctggccgtg gctgtggaac 
tcatcacgtg gggggcagat accgccgcgt 
ctgcccgtag gggccaggag atactgcttg 
ggaggttgct ggcgcccatc acggcgtacg 
taatcaccag cctgactggc cgggacaaaa 
caactgctac ccaaaccttc ctggcaacgt 
acggggccgg aacgaggacc atcgcatcac 
atgtggacca agaccttgtg ggctggcccg 
gtacctgcgg ctcctcggac ctttacctgg 
gccggcgagg tgatagcagg ggtagcctgc 
gctcctcggg gggtccgctg ttgtgccccg 
cggtgtgcac ccgtggagtg gctaaagcgg 
caaccatgag atccccggtg ttcacggaca 
tccaggtggc ccacctgcat gctcccaccg 
cgtacgcagc ccagggctac aaggtgttgg 
gctttggtgc ttacatgtcc aaggcccatg 
gaacaattac cactggcagc cccatcacgt 
gcgggtgctc aggaggtgct tatgacataa 
ccacatccat cttgggcatc ggcactgtcc 
tggttgtgct cgccactgct acccctccgg 
aggaggttgc tctgtccacc accggagaga 
aggtgatcaa ggggggaaga catctcatct 
tcgccgcgaa gctggtcgca ttgggcatca 
tgtctgtcat cccgaccagc ggcgatgttg 
gctttaccgg cgacttcgac tctgtgatag 
atttcagcct tgaccctacc tttaccattg 
ccaggactca acgccggggc aggactggca 
caccggggga gcgcccctcc ggcatgttcg 
cgggctgtgc ttggtatgag ctcacgcccg 
tgaacacccc ggggcttccc gtgtgccagg 
cgggcctcac tcatatagat gcccactttt 
ttccttacct ggtagcgtac caagccaccg 
cgtgggacca gatgtggaag tgtttgatcc 
ccctgctata cagactgggc gctgttcaga 
aatacatcat gacatgcatg tcggccgacc 



tcgtccgatg ggagtgggta atactcttat 2520 
cctgcttatg gatgctcatc ttgttgggcc 2580 
tcttgcacgc tgcgagcgca gctagctgca 2640 
tggctgcttg gtacatcaag ggtcgggtag 2700 
tgtggtcctt tagcctactg ctcctagcat 2760 
aggtggccgc gtcgtgtggc ggcgttgttc 2820 
catattacaa gcgctatatc agctggtgca 2880 
tagaagcgca actgcacgtg tgggttcccc 2940 
tcatcttact catgtgtgta gtacacccga 3000 
tggccatctt cggacccctt tggattcttc 3060 
tgcgcgttca aggccttctc cggatctgcg 3120 
acgtgcaaat ggccatcatc aagttagggg 3180 
tcacccctct tcgagactgg gcgcacaacg 3240 
cagtcgtctt ctcccgaatg gagaccaagc 3300 
gcggtgacat catcaacggc ttgcccgtct 3360 
ggccagccga cggaatggtc tccaaggggt 3420 
cccagcagac gagaggcctc ctagggtgta 3480 
accaagtgga gggtgaggtc cagatcgtgt 3540 
gcatcaatgg ggtatgctgg actgtctacc 3600 
ccaagggtcc tgtcatccag atgtatacca 3660 
ctcctcaagg ttcccgctca ttgacaccct 3720 
tcacgaggca cgccgatgtc attcccgtgc 3780 
tttcgccccg gcccatttcc tacttgaaag 3840 
cgggacacgc cgtgggccta ttcagggccg 3900 
tggactttat ccctgtggag aacctaggga 3960 
actcctctcc accagcagtg ccccagagct 4020 
gcagcggtaa gagcaccaag gtcccggctg 4080 
tgctcaaccc ctctgttgct gcaacgctgg 4140 
gggttgatcc taatatcagg accggggtga 4200 
actccaccta cggcaagttc cttgccgacg 4260 
taatttgtga cgagtgccac tccacggatg 4320 
ttgaccaagc agagactgcg ggggcgagac 4380 
gctccgtcac tgtgtcccat cctaacatcg 4440 
tcccctttta cggcaaggct atccccctcg 4500 
tctgccactc aaagaagaag tgcgacgagc 4560 
atgccgtggc ctactaccgc ggtcttgacg 4620 
tcgtcgtgtc gaccgatgct ctcatgactg 4680 
actgcaacac gtgtgtcact cagacagtcg 4740 
agacaaccac gctcccccag gatgctgtct 4800 
gggggaagcc aggcatctat agatttgtgg 4860 
actcgtccgt cctctgtgag tgctatgacg 4920 
ccgagactac agttaggcta cgagcgtaca 4980 
accatcttga attttgggag ggcgtcttta 5040 
tatcccagac aaagcagagt ggggagaact 5100 
tgtgcgctag ggctcaagcc cctcccccat 5160 
gccttaaacc caccctccat gggccaacac 5220 
atgaagtcac cctgacgcac ccaatcacca 5280 
tggaggtcgt cacgagcacc tgggtgctcg 5340 
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ttggcggcgt cctggctgct ctggccgcgt attgcctgtc aacaggctgc gtggtcatag 5400 
tgggcaggat cgtcttgtcc gggaagccgg caattatacc tgacagggag gttctctacc 5460 
aggagttcga tgagatggaa gagtgctctc agcacttacc gtacatcgag caagggatga 5520 
tgctcgctga gcagttcaag cagaaggccc tcggcctcct gcagaccgcg tcccgccatg 5580 
cagaggttat cacccctgct gtccagacca actggcagaa actcgaggtc ttttgggcga 5640 
agcacatgtg gaatttcatc agtgggatac aatacttggc gggcctgtca acgctgcctg 5700 
gtaaccccgc cattgcttca ttgatggctt ttacagctgc cgtcaccagc ccactaacca 5760 
ctggccaaac cctcctcttc aacatattgg gggggtgggt ggctgcccag ctcgccgccc 5820 
ccggtgccgc tactgccttt gtgggtgctg gcctagctgg cgccgccatc ggcagcgttg 5880 
gactggggaa ggtcctcgtg gacattcttg cagggtatgg cgcgggcgtg gcgggagctc 5940 
ttgtagcatt caagatcatg agcggtgagg tcccctccac ggaggacctg gtcaatctgc 6000 
tgcccgccat cctctcgcct ggagcccttg tagtcggtgt ggtctgcgca gcaatactgc 6060 
gccggcacgt tggcccgggc gagggggcag tgcaatggat gaaccggcta atagccttcg 6120 
cctcccgggg gaaccatgtt tcccccacgc actacgtgcc ggagagcgat gcagccgccc 6180 
gcgtcactgc catactcagc agcctcactg taacccagct cctgaggcga ctgcatcagt 6240 
ggataagctc ggagtgtacc actccatgct ccggttcctg gctaagggac atctgggact 6300 
ggatatgcga ggtgctgagc gactttaaga cctggctgaa agccaagctc atgccacaac 6360 
tgcctgggat tccctttgtg tcctgccagc gcgggtatag gggggtctgg cgaggagacg 6420 
gcattatgca cactcgctgc cactgtggag ctgagatcac tggacatgtc aaaaacggga 6480 
cgatgaggat cgtcggtcct aggacctgca ggaacatgtg gagtgggacg ttccccatta 6540 
acgcctacac cacgggcccc tgtactcccc ttcctgcgcc gaactataag ttcgcgctgt 6600 
ggagggtgtc tgcagaggaa tacgtggaga taaggcgggt gggggacttc cactacgtat 6660 
cgggtatgac tactgacaat cttaaatgcc cgtgccagat cccatcgccc gaatttttca 6720 
cagaattgga cggggtgcgc ctacacaggt ttgcgccccc ttgcaagccc ttgctgcggg 6780 
aggaggtatc attcagagta ggactccacg agtacccggt ggggtcgcaa ttaccttgcg 6840 
agcccgaacc ggacgtagcc gtgttgacgt ccatgctcac tgatccctcc catataacag 6900 
cagaggcggc cgggagaagg ttggcgagag ggtcaccccc ttctatggcc agctcctcgg 6960 
ctagccagct gtccgctcca tctctcaagg caacttgcac cgccaaccat gactcccctg 7020 
acgccgagct catagaggct aacctcctgt ggaggcagga gatgggcggc aacat caeca 7080 
gggttgagtc agagaacaaa gtggtgattc tggactcctt cgatccgctt gtggcagagg 7140 
aggatgagcg ggaggtctcc gtacctgcag aaattctgcg gaagtctegg agattcgccc 7200 
gggccctgcc cgtctgggcg cggccggact acaacccccc gctagtagag acgtggaaaa 7260 
agectgacta cgaaccacct gtggtccatg gctgcccgct accacctcca cggtcccctc 7320 
ctgtgcctcc gecteggaaa aagegtaegg tggtcctcac cgaatcaacc ctatctactg 7380 
ccttggccga gcttgccacc aaaagttttg gcagctcctc aacttccggc attaegggeg 7440 
acaatacgac aacatcctct gagcccgccc cttctggctg cccccccgac tccgacgttg 7500 
agtcctattc ttccatgccc cccctggagg gggagcctgg ggatceggat ctcagcgacg 7560 
ggtcatggtc gaeggtcagt agtggggccg acaeggaaga tgtcgtgtgc tgctcaatgt 7620 
cttattcctg gaeaggegea ctcgtcaccc cgtgcgctgc ggaagaacaa aaactgccca 7680 
tcaacgcact gagcaactcg ttgctacgcc atcacaatct ggtgtattcc accacttcac 7740 
gcagtgcttg ccaaaggcag aagaaagtca catttgacag actgeaagtt ctggacagcc 7800 
attaccagga cgtgctcaag gaggtcaaag cagcggcgtc aaaagtgaag getaacttge 7860 
tatcegtaga ggaagcttgc agcctgacgc ccccacattc agccaaatcc aagtttggct 7920 
atggggcaaa agaegtcegt tgccatgcca gaaaggccgt agcccacatc aactccgtgt 7980 
ggaaagacct tctggaagac agtgtaacac caatagacac taccatcatg gecaagaacg 8040 
aggttttctg cgttcagcct gagaaggggg gtegtaagee agctcgtctc atcgtgttcc 8100 
ccgacctggg cgtgcgcgtg tgegagaaga tggccctgta cgacgtggtt agcaagctcc 8160 
ccctggccgt gatgggaagc tectaeggat tccaatactc accaggacag cgggttgaat 8220 
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tcctcgtgca agcgtggaag tccaagaaga 
gttttgactc cacagtcact gagagcgaca 
gtgacctgga cccccaagcc cgcgtggcca 
ggggccctct taccaattca aggggggaaa 
gcgtactgac aactagctgt ggtaacaccc 
gtcgagccgc agggctccag gactgcacca 
tctgtgaaag tgcgggggtc caggaggacg 
tgaccaggta ctccgccccc cccggggacc 
taacatcatg ctcctccaac gtgtcagtcg 
accttacccg tgaccctaca acccccctcg 
ctccagtcaa ttcctggcta ggcaacataa 
tgatactgat gacccatttc tttagcgtcc 
ttaactgtga gatctacgga gcctgctact 
ttcaaagact ccatggcctc agcgcatttt 
atagggtggc cgcatgcctc agaaaacttg 
gggcccggag cgtccgcgct aggcttctgt 
agtacctctt caactgggca gtaagaacaa 
gccggctgga cttgtccggt tggttcacgg 
gcgtgtctca tgcccggccc cgctggttct 
taggcatcta cctcctcccc aaccgatgaa 
catttcctgt tttttttttt tttttttttt 
ttcttttttt cctttctttt tcccttcttt 
gctagctgtg aaaggtccgt gagccgcatg 
gcagatcatg t 



ccccgatggg gttctcgtat gatacccgct 8280 
tccgtacgga ggaggcaatt taccaatgtt 8340 
tcaagtccct cactgagagg ctttatgttg 8400 
actgcggcta ccgcaggtgc cgcgcgagcg 8460 
tcacttgcta catcaaggcc cgggcagcct 8520 
tgctcgtgtg tggcgacgac ttagtcgtta 8580 
cggcgagcct gagagccttc acggaggcta 8640 
ccccacaacc agaatacgac ttggagctta 8700 
cccacgacgg cgctggaaag agggtctact 8760 
cgagagccgc gtgggagaca gcaagacaca 8820 
tcatgtttgc ccccacactg tgggcgagga 8880 
tcatagccag ggatcagctt gaacaggctc 8940 
ccatagaacc actggatcta cctccaatca 9000 
cactccacag ttactctcca ggtgaaatca 9060 
gggtcccgcc cttgcgagct tggagacacc 9120 
ccagaggagg cagggctgct atatgtggca 9180 
agctcaaact cactccaata gcggccgctg 9240 
ctggctacag cgggggagac atttatcaca 9300 
ggttttgcct actcctgctc gctgcagggg 9360 
ggttggggta aacactccgg cctcttaagc 9420 
tttttttctt tttttttttc tttcctttcc 9480 
aatggtggct ccatcttagc cctagtcacg 9540 
actgcagaga gtgctgatac tggcctctct 9600 

9611 



<210> 4 
<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
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85 



90 



95 



Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn He Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 
325 330 335 



Ala Met Arg Val Pro Glu Val lie He Asp He He Ser Gly Ala His 
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340 



345 



350 



Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser *Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 



His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
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595 



600 



605 



Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 

Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 

Ala Cys Leu Trp Met Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val He Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val He Phe Phe Val Ala Ala Trp Tyr He Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Leu Asp Thr 
805 810 815 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr He Ser Trp Cys Met Trp 
835 840 845 



Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
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850 855 860 

Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp lie Thr Lys Leu Leu 
885 890 895 

Leu Ala lie Phe Gly Pro Leu Trp lie Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg lie Cys Ala Leu 
915 920 925 

Ala Arg Lys He Ala Gly Gly His Tyr Val Gin Met Ala He He Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 

Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Arg Gly Gin Glu He Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
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1105 1110 1115 1120 

Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg. Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 4225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 
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1365 1370 1375 

He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He LyB Gly Gly 
1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val Ser 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr Arg 
1490 1495 1500 

Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu He 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
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1620 1625 1630 

Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr 
1635 1640 1645 

lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 
1665 1670 1675 1680 

Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys Pro 
1685 1690 1695 

Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu 
1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 
1745 1750 1755 1760 

Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr- Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys lie 
1860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
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1875 1880 1885 

Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly He Met His Thr Arg 
2020 2025 2030 

Cys His Cys Gly Ala Glu He Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

He Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
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2130 2135 2140 

Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Gly Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 
2210 2215 2220 

Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu 
2225 2230 2235 2240 

Met Gly Gly Asn He Thr Arg Val Glu Ser Glu Asn Lys Val Val He 
2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 
2260 2265 2270 

Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 
2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 
2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 
2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 
2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly He Thr Gly Asp Asn 
2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 
2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
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2385 2390 2395 2400 

Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 



Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 

Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His He Asn 
2530 2535 2540 

Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro He Asp Thr 
2545 2550 2555 2560 

Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 

Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg 
2610 2615 2620 

Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly 
2625 2630 2635 2640 

Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp 
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2645 2650 2655 

lie Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

lie Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val lie Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 

Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly 
2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

Leu Gly Asn lie lie Met Phe Ala Pro Thr Leu Trp Ala Arg Met He 
2835 2840 2845 

Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu He Tyr Gly Ala Cys Tyr Ser He Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala Cys 
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2900 2905 2910 

Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala lie 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro lie Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp lie Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cy8 Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

lie Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 5 
<211> 9611 
<212> DNA 

<213> Hepatitis C virus 
<400> 5 

gccagccccc tgatgggggc gacactccac catgaatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccgg 180 
gaagactggg tcctttcttg gataaaccca ctctatgccc ggccatttgg gcgtgccccc 240 
gcaagactgc tagccgagta gcgttgggtt gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcaca aatcctaaac 360 
ctcaaagaaa aaccaaaaga aacaccaacc gtcgcccaca agacgttaag tttccgggcg 420 
gcggccagat cgttggcgga gtatacttgt tgccgcgcag gggccccagg ttgggtgtgc 480 
gcgcgacaag gaagacttcg gagcggtccc agccacgtgg aaggcgccag' cccatcccta 540 
aagatcggcg ctccactggc aaatcctggg gaaaaccagg atacccctgg cccctatacg 600 
ggaatgaggg actcggctgg gcaggatggc tcctgtcccc ccgaggttcc cgtccctctt 660 
ggggccccaa tgacccccgg cataggtcgc gcaacgtggg taaggtcatc gataccctaa 720 
cgtgcggctt tgccgacctc atggggtaca tccctgtcgt gggcgccccg ctcggcggcg 780 
tcgccagagc tctcgcgcat ggcgtgagag tcctggagga cggggttaat tttgcaacag 840 
ggaacttacc cggttgctcc ttttctatct tcttgctggc cctgctgtcc tgcatcacca 900 
ccccggtctc cgctgccgaa gtgaagaaca tcagtaccgg ctacatggtg actaacgact 960 
gcaccaatga cagcattacc tggcagctcc aggctgctgt cctccacgtc cccgggtgcg 1020 
tcccgtgcga gaaagtgggg aatgcatctc agtgctggat accggtctca ccgaatgtgg 1080 
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ccgtgcagcg gcccggcgcc ctcacgcagg gcttgcggac gcacatcgac atggttgtga 1140 
tgtccgccac gctctgctct gccctctacg tgggggacct ctgcggtggg gtgatgctcg 1200 
cagcccaaat gttcattgtc tcgccgcagc accactggtt tgtccaagac tgcaattgct 1260 
ccatctaccc tggtaccatc actggacacc gcatggcatg ggacatgatg atgaactggt 1320 
cgcccacggc taccatgatc ttggcgtacg cgatgcgtgt ccccgaggtc attatagaca 1380 
tcattagcgg ggctcattgg ggcgtcatgt tcggcttggc ctacttctct atgcagggag 1440 
cgtgggcgaa agtcgttgtc atccttctgt tggccgccgg ggtggacgcg cgcacccata 1500 
ctgttggggg ttctgccgcg cagaccaccg ggcgcctcac cagcttattt gacatgggcc 1560 
ccaggcagaa aatccagctc gttaacacca atggcagctg gcacatcaac cgcaccgccc 1620 
tgaactgcaa tgactccttg cacaccggct ttatcgcgtc tctgttctac acccacagct 1680 
tcaactcgtc aggatgtccc gaacgcatgt ccgcctgccg cagtatcgag gccttccggg 1740 
tgggatgggg cgccttgcaa tatgaggata atgtcaccaa tccagaggat atgagaccct 1800 
attgctggca ctacccacca aggcagtgtg gcgtggtctc cgcgaagact gtgtgtggcc 1860 
cagtgtactg tttcaccccc agcccagtgg tagtgggcac gaccgacagg cttggagcgc 1920 
ccacttacac gtggggggag aatgagacag atgtcttcct attgaacagc actcgaccac 1980 
cgctggggtc atggttcggc tgcacgtgga tgaactcttc tggctacacc aagacttgcg 2040 
gcgcaccacc ctgccgtact agagctgact tcaacgccag cacggacctg ttgtgcccca 2100 
cggactgttt taggaagcat cctgatacca cttacctcaa atgcggctct gggccctggc 2160 
tcacgccaag gtgcctgatc gactacccct acaggctctg gcattacccc tgcacagtta 2220 
actataccat cttcaaaata aggatgtatg tgggaggggt tgagcacagg ctcacggctg 2280 
catgcaattt cactcgtggg gatcgttgca acttggagga cagagacaga agtcaactgt 2340 
ctcctttgtt gcactccacc acggaatggg ccattttacc ttgctcttac tcggacctgc 2400 
ccgccttgtc gactggtctt ctccacctcc accaaaacat cgtggacgta caattcatgt 2460 
atggcctatc acctgccctc acaaaataca tcgtccgatg ggagtgggta atactcttat 2520 
tcctgctctt agcggacgcc agggtttgcg cctgcttatg gatgctcatc ttgttgggcc 2580 
aggccgaagc agctttggag aacctcgtaa tactcaatgc agcatccctg gccgggacgc 2640 
acggtcttgt gtccttcctc gtgttcttct gctttgcgtg gtatctgaag ggtaggtggg 2700 
tgcccggagc ggtctacgcc ctctacggga tgtggcctct cctcctgctc ctgctggcgt 2760 
tgcctcagcg ggcatatgca ctggacacgg aggtggccgc gtcgtgtggc ggcgttgttc 2820 
ttgtcgggtt aatggcgctg actctgtcgc catattacaa gcgctatatc agctggtgca 2880 
tgtggtggct tcagtatttt ctgaccagag tagaagcgca actgcacgtg tgggttcccc 2940 
ccctcaacgt ccgggggggg cgcgatgccg tcatcttact catgtgtgta gtacacccga 3000 
ccctggtatt tgacatcacc aaactactcc tggccatctt cggacccctt tggattcttc 3060 
aagccagttt gcttaaagtc ccctacttcg tgcgcgttca aggccttctc cggatctgcg 3120 
cgctagcgcg gaagatagcc ggaggtcatt acgtgcaaat ggccatcatc aagttagggg 3180 
cgcttactgg cacctatgtg tataaccatc tcacccctct tcgagactgg gcgcacaacg 3240 
gcctgcgaga tctggccgtg gctgtggaac cagtcgtctt ctcccgaatg gagaccaagc 3300 
tcatcacgtg gggggcagat accgccgcgt gcggtgacat catcaacggc ttgcccgtct 3360 
ctgcccgtag gggccaggag atactgcttg ggccagccga cggaatggtc tccaaggggt 3420 
ggaggttgct ggcgcccatc acggcgtacg cccagcagac gagaggcctc ctagggtgta 3480 
taatcaccag cctgactggc cgggacaaaa accaagtgga gggtgaggtc cagatcgtgt 3540 
caactgctac ccaaaccttc ctggcaacgt gcatcaatgg ggtatgctgg actgtctacc 3600 
acggggccgg aacgaggacc atcgcatcac ccaagggtcc tgtcatccag atgtatacca 3660 
atgtggacca agaccttgtg ggctggcccg ctcctcaagg ttcccgctca ttgacaccct 3720 
gtacctgcgg ctcctcggac ctttacctgg tcacgaggca cgccgatgtc attcccgtgc 3780 
gccggcgagg tgatagcagg ggtagcctgc tttcgccccg gcccatttcc tacttgaaag 3840 
gctcctcggg gggtccgctg ttgtgccccg cgggacacgc cgtgggccta ttcagggccg 3900 
cggtgtgcac ccgtggagtg gctaaagcgg tggactttat ccctgtggag aacctaggga 3960 
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caaccatgag atccccggtg ttcacggaca actcctctcc accagcagtg ccccagagct 4020 
tccaggtggc ccacctgcat gctcccaccg gcagcggtaa gagcaccaag gtcccggctg 4080 
cgtacgcagc ccagggctac aaggtgttgg tgctcaaccc ctctgttgct gcaacgctgg 4140 
gctttggtgc ttacatgtcc aaggcccatg gggttgatcc taatatcagg accggggtga 4200 
gaacaattac cactggcagc cccatcacgt actccaccta cggcaagttc cttgccgacg 4260 
gcgggtgctc aggaggtgct tatgacataa taatttgtga cgagtgccac tccacggatg 4320 
ccacatccat cttgggcatc ggcactgtcc ttgaccaagc agagactgcg ggggcgagac 4380 
tggttgtgct cgccactgct acccctccgg gctccgtcac tgtgtcccat cctaacatcg 4440 
aggaggttgc tctgtccacc accggagaga tcccctttta cggcaaggct atccccctcg 4500 
aggtgatcaa ggggggaaga catctcatct tctgccactc aaagaagaag tgcgacgagc 4560 
tcgccgcgaa gctggtcgca ttgggcatca atgccgtggc ctactaccgc ggtcttgacg 4620 
tgtctgtcat cccgaccagc ggcgatgttg tcgtcgtgtc gaccgatgct ctcatgactg 4680 
gctttaccgg cgacttcgac tctgtgatag actgcaacac gtgtgtcact cagacagtcg 4740 
atttcagcct tgaccctacc tttaccattg agacaaccac gctcccccag gatgctgtct 4800 
ccaggactca acgccggggc aggactggca gggggaagcc aggcatctat agatttgtgg 4860 
caccggggga gcgcccctcc ggcatgttcg actcgtccgt cctctgtgag tgctatgacg 4920 
cgggctgtgc ttggtatgag ctcacgcccg ccgagactac agttaggcta cgagcgtaca 4980 
tgaacacccc ggggcttccc gtgtgccagg accatcttga attttgggag ggcgtcttta 5040 
cgggcctcac tcatatagat gcccactttt tatcccagac aaagcagagt ggggagaact 5100 
ttccttacct ggtagcgtac caagccaccg tgtgcgctag ggctcaagcc cctcccccat 5160 
cgtgggacca gatgtggaag tgtttgatcc gccttaaacc caccctccat gggccaacac 5220 
ccctgctata cagactgggc gctgttcaga atgaagtcac cctgacgcac ccaatcacca 5280 
aatacatcat gacatgcatg tcggccgacc tggaggtcgt cacgagcacc tgggtgctcg 5340 
ttggcggcgt cctggctgct ctggccgcgt attgcctgtc aacaggctgc gtggtcatag 5400 
tgggcaggat cgtcttgtcc gggaagccgg caattatacc tgacagggag gttctctacc 5460 
aggagttcga tgagatggaa gagtgctctc agcacttacc gtacatcgag caagggatga 5520 
tgctcgctga gcagttcaag cagaaggccc tcggcctcct gcagaccgcg tcccgccatg 5580 
cagaggttat cacccctgct gtccagacca actggcagaa actcgaggtc ttttgggcga 5640 
agcacatgtg gaatttcatc agtgggatac aatacttggc gggcctgtca acgctgcctg 5700 
gtaaccccgc cattgcttca ttgatggctt ttacagctgc cgtcaccagc ccactaacca 5760 
ctggccaaac cctcctcttc aacatattgg gggggtgggt ggctgcccag ctcgccgccc 5820 
ccggtgccgc tactgccttt gtgggtgctg gcctagctgg cgccgccatc ggcagcgttg 5880 
gactggggaa ggtcctcgtg gacattcttg cagggtatgg cgcgggcgtg gcgggagctc 5940 
ttgtagcatt caagatcatg agcggtgagg tcccctccac ggaggacctg gtcaatctgc 6000 
tgcccgccat cctctcgcct ggagcccttg tagtcggtgt ggtctgcgca gcaatactgc 6060 
gccggcacgt tggcccgggc gagggggcag tgcaatggat gaaccggcta atagccttcg 6120 
cctcccgggg gaaccatgtt tcccccacgc actacgtgcc ggagagcgat gcagccgccc 6180 
gcgtcactgc catactcagc agcctcactg taacccagct cctgaggcga ctgcatcagt 6240 
ggataagctc ggagtgtacc actccatgct ccggttcctg gctaagggac atctgggact 6300 
ggatatgcga ggtgctgagc gactttaaga cctggctgaa agccaagctc atgccacaac 6360 
tgcctgggat tccctttgtg tcctgccagc gcgggtatag gggggtctgg cgaggagacg 6420 
gcattatgca cactcgctgc cactgtggag ctgagatcac tggacatgtc aaaaacggga 6480 
cgatgaggat cgtcggtcct aggacctgca ggaacatgtg gagtgggacg ttccccatta 6540 
acgcctacac cacgggcccc tgtactcccc ttcctgcgcc gaactataag ttcgcgctgt 6600 
ggagggtgtc tgcagaggaa tacgtggaga taaggcgggt gggggacttc cactacgtat 6660 
cgggtatgac tactgacaat cttaaatgcc cgtgccagat cccatcgccc gaatttttca 6720 
cagaattgga cggggtgcgc ctacacaggt ttgcgccccc ttgcaagccc ttgctgcggg 6780 
aggaggtatc attcagagta ggactccacg agtacccggt ggggtcgcaa ttaccttgcg 6840 
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agcccgaacc ggacgtagcc gtgttgacgt ccatgctcac tgatccctcc catataacag 6900 
cagaggcggc cgggagaagg ttggcgagag ggtcaccccc ttctatggcc agctcctcgg 6960 
ctagccagct gtccgctcca tctctcaagg caacttgcac cgccaaccat gactcccctg 7020 
acgccgagct catagaggct aacctcctgt ggaggcagga gatgggcggc aacatcacca 7080 
gggttgagtc agagaacaaa gtggtgattc tggactcctt cgatccgctt gtggcagagg 7140 
aggatgagcg ggaggtctcc gtacctgcag aaattctgcg gaagtctcgg agattcgccc 7200 
gggccctgcc cgtctgggcg cggccggact acaacccccc gctagtagag acgtggaaaa 7260 
agcctgacta cgaaccacct gtggtccatg gctgcccgct accacctcca cggtcccctc 7320 
ctgtgcctcc gcctcggaaa aagcgtacgg tggtcctcac cgaatcaacc ctatctactg 7380 
ccttggccga gcttgccacc aaaagttttg gcagctcctc aacttccggc attacgggcg 7440 
acaatacgac aacatcctct gagcccgccc cttctggctg cccccccgac tccgacgttg 7500 
agtcctattc ttccatgccc cccctggagg gggagcctgg ggatccggat ctcagcgacg 7560 
ggtcatggtc gacggtcagt agtggggccg acacggaaga tgtcgtgtgc tgctcaatgt 7620 
cttattcctg gacaggcgca ctcgtcaccc cgtgcgctgc ggaagaacaa aaactgccca 7680 
tcaacgcact gagcaactcg ttgctacgcc atcacaatct ggtgtattcc accacttcac 7740 
gcagtgcttg ccaaaggcag aagaaagtca catttgacag actgcaagtt ctggacagcc 7800 
attaccagga cgtgctcaag gaggtcaaag cagcggcgtc aaaagtgaag gctaacttgc 7860 
tatccgtaga ggaagcttgc agcctgacgc ccccacattc agccaaatcc aagtttggct 7920 
atggggcaaa agacgtccgt tgccatgcca gaaaggccgt agcccacatc aactccgtgt 7980 
ggaaagacct tctggaagac agtgtaacac caatagacac taccatcatg gccaagaacg 8040 
aggttttctg cgttcagcct gagaaggggg gtcgtaagcc agctcgtctc atcgtgttcc 8100 
ccgacctggg cgtgcgcgtg tgcgagaaga tggccctgta cgacgtggtt agcaagctcc 8160 
ccctggccgt gatgggaagc tcctacggat tccaatactc accaggacag cgggttgaat 8220 
tcctcgtgca agcgtggaag tccaagaaga ccccgatggg gttctcgtat gatacccgct 8280 
gttttgactc cacagtcact gagagcgaca tccgtacgga ggaggcaatt taccaatgtt 8340 
gtgacctgga cccccaagcc cgcgtggcca tcaagtccct cactgagagg ctttatgttg 8400 
ggggccctct taccaattca aggggggaaa actgcggcta ccgcaggtgc cgcgcgagcg 8460 
gcgtactgac aactagctgt ggtaacaccc tcacttgcta catcaaggcc cgggcagcct 8520 
gtcgagccgc agggctccag gactgcacca tgctcgtgtg tggcgacgac ttagtcgtta 8580 
tctgtgaaag tgcgggggtc caggaggacg cggcgagcct gagagccttc acggaggcta 8640 
tgaccaggta ctccgccccc cccggggacc ccccacaacc agaatacgac ttggagctta 8700 
taacatcatg ctcctccaac gtgtcagtcg cccacgacgg cgctggaaag agggtctact 8760 
accttacccg tgaccctaca acccccctcg cgagagccgc gtgggagaca gcaagacaca 8820 
ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc ccccacactg tgggcgagga 8880 
tgatactgat gacccatttc tttagcgtcc tcatagccag ggatcagctt gaacaggctc 8940 
ttaactgtga gatctacgga gcctgctact ccatagaacc actggatcta cctccaatca 9000 
ttcaaagact ccatggcctc agcgcatttt cactccacag ttactctcca ggtgaaatca 9060 
atagggtggc cgcatgcctc agaaaacttg gggtcccgcc cttgcgagct tggagacacc 9120 
gggcccggag cgtccgcgct aggcttctgt ccagaggagg cagggctgct atatgtggca 9180 
agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 
gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 9300 
gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 
taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 9420 
catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 9480 
ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 9540 
gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 9600 
gcagatcatg t 9611 
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<210> 6 
<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 6 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 .70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn He Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 
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Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp lie 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His lie Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Glri Met Phe lie Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 



Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 
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Pro Glu Asp Met Arg Pro Tyx Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 

Pro Arg Cys Leu He Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 



Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 
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Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Asn Leu Val lie Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
755 760 765 

Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
770 775 780 

Arg Trp Val Pro Gly Ala Val Tyr Ala Leu Tyr Gly Met Trp Pro Leu 
785 790 795 800 

Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu Asp Thr 
805 810 8X5 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr lie Ser Trp Cys Met Trp 
835 840 845 

Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
850 855 860 

Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp He Thr Lys Leu Leu 
885 890 895 

Leu Ala He Phe Gly Pro Leu Trp He Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg He Cys Ala Leu 
915 920 925 

Ala Arg Lys He Ala Gly Gly His Tyr Val Gin Met Ala He He Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 



Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 
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Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Arg Gly Gin Glu He Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
1105 1110 1115 1120 

Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 
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Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 
1365 1370 1375 

He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He Lys Gly Gly 
1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val Ser 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly lie Tyr Arg 
1490 1495 1500 
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Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 

Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr 
1635 1640 1645 

He Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 
1665 1670 1675 1680 

Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys Pro 
1685 1690 1695 

Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu 
1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 
1745 1750 1755 1760 
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Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys He 
1860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
1875 1880 1885 

Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 
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Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly He Met His Thr Arg 
2020 2025 2030 

Cys His Cys Gly Ala Glu He Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

He Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
2130 2135 2140 

Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Gly Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 
2210 2215 2220 

Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu 
2225 2230 2235 2240 

Met Gly Gly Asn He Thr Arg Val Glu Ser Glu Asn Lys Val Val He 
2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 
2260 2265 2270 
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Ser Val Pro Ala Glu lie Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 
2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 
2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 
2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 
2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly lie Thr Gly Asp Asn 
2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 
2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 
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Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His He Asn 
2530 2535 2540 

Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro He Asp Thr 
2545 2550 2555 2560 

Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 

Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg 
2610 2615 2620 

Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly 
2625 2630 2635 2640 

Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp 
2645 2650 2655 

He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 
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Glu Leu lie Thr Ser Cys Ser Ser Aan Val Ser Val Ala His Asp Gly 
2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

Leu Gly Asn He He Met Phe Ala Pro Thr Leu Trp Ala Arg Met He 
2835 2840 2845 

Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu He Tyr Gly Ala Cys Tyr Ser He Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala Cys 
2900 2905 2910 

Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala He 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro He Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp He Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

He Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 7 
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<211> 9611 
<212> DNA 

<213> Hepatitis C virus 



<400> 7 

gccagccccc tgatgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gataaacccg 
gcaagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaaga aacaccaacc 
gcggccagat cgttggcgga gtatacttgt 
gcgcgacaag gaagacttcg gagcggtccc 
aagatcggcg ctccactggc aaatcctggg 
ggaatgaggg actcggctgg gcaggatggc 
ggggccccaa tgacccccgg cataggtcgc 
cgtgcggctt tgccgacctc atggggtaca 
tcgccagagc tctcgcgcat ggcgtgagag 
ggaacttacc cggttgctcc ttttctatct 
ccccggtctc cgctgccgaa gtgaagaaca 
gcaccaatga cagcattacc tggcagctcc 
tcccgtgcga gaaagtgggg aatgcatctc 
ccgtgcagcg gcccggcgcc ctcacgcagg 
tgtccgccac gctctgctct gccctctacg 
cagcccaaat gttcattgtc tcgccgcagc 
ccatctaccc tggtaccatc- actggacacc 
cgcccacggc taccatgatc ttggcgtacg 
tcattagcgg ggctcattgg ggcgtcatgt 
cgtgggcgaa agtcgttgtc atccttctgt 
ctgttggggg ttctgccgcg cagaccaccg 
ccaggcagaa aatccagctc gttaacacca 
tgaactgcaa tgactccttg cacaccggct 
tcaactcgtc aggatgtccc gaacgcatgt 
tgggatgggg cgccttgcaa tatgaggata 
attgctggca ctacccacca aggcagtgtg 
cagtgtactg tttcaccccc agcccagtgg 
ccacttacac gtggggggag aatgagacag 
cgctggggtc atggttcggc tgcacgtgga 
gcgcaccacc ctgccgtact agagctgact 
cggactgttt taggaagcat cctgatacca 
tcacgccaag gtgcctgatc gactacccct 
actataccat cttcaaaata aggatgtatg 
catgcaattt cactcgtggg gatcgttgca 
ctcctttgtt gcactccacc acggaatggg 
ccgccttgtc gactggtctt ctccacctcc 
atggcctatc acctgccctc acaaaataca 
tcctgctctt agcggacgcc agggtttgcg 



catgaatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcaca aatcctaaac 360 
gtcgcccaca agacgttaag tttccgggcg 420 
tgccgcgcag gggccccagg ttgggtgtgc 480 
agccacgtgg aaggcgccag cccatcccta 540 
gaaaaccagg atacccctgg cccctatacg 600 
tcctgtcccc ccgaggttcc cgtccctctt 660 
gcaacgtggg taaggtcatc gataccctaa 720 
tccctgtcgt gggcgccccg ctcggcggcg 780 
tcctggagga cggggttaat tttgcaacag 840 
tcttgctggc cctgctgtcc tgcatcacca 900 
tcagtaccgg ctacatggtg actaacgact 960 
aggctgctgt cctccacgtc cccgggtgcg 1020 
agtgctggat accggtctca ccgaatgtgg 1080 
gcttgcggac gcacatcgac atggttgtga 1140 
tgggggacct ctgcggtggg gtgatgctcg 1200 
accactggtt tgtccaagac tgcaattgct 1260 
gcatggcatg ggacatgatg atgaactggt 1320 
cgatgcgtgt ccccgaggtc attatagaca 1380 
tcggcttggc ctacttctct atgcagggag 1440 
tggccgccgg ggtggacgcg cgcacccata 1500 
ggcgcctcac cagcttattt gacatgggcc 1560 
atggcagctg gcacatcaac cgcaccgccc 1620 
ttatcgcgtc tctgttctac acccacagct 1680 
ccgcctgccg cagtatcgag gccttccggg 1740 
atgtcaccaa tccagaggat atgagaccct 1800 
gcgtggtctc cgcgaagact gtgtgtggcc 1860 
tagtgggcac gaccgacagg cttggagcgc 1920 
atgtcttcct attgaacagc actcgaccac 1980 
tgaactcttc tggctacacc aagacttgcg 2040 
tcaacgccag cacggacctg ttgtgcccca 2100 
cttacctcaa atgcggctct gggccctggc 2160 
acaggctctg gcattacccc tgcacagtta 2220 
tgggaggggt tgagcacagg ctcacggctg 2280 
acttggagga cagagacaga agtcaactgt 2340 
ccattttacc ttgctcttac tcggacctgc 2400 
accaaaacat cgtggacgta caattcatgt 2460 
tcgtccgatg ggagtgggta atactcttat 2520 
cctgcttatg gatgctcatc ttgttgggcc 2580 
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aggccgaagc agcactagag aagctggtca 
atggcttcct atattttgtc atctttttcg 
tccccttagc tacctattcc ctcactggcc 
tgccccaaca ggcatatgca ctggacacgg 
ttgtcgggtt aatggcgctg actctgtcgc 
tgtggtggct tcagtatttt ctgaccagag 
ccctcaacgt ccgggggggg cgcgatgccg 
ccctggtatt tgacatcacc aaactactcc 
aagccagttt gcttaaagtc ccctacttcg 
cgctagcgcg gaagatagcc ggaggtcatt 
cgcttactgg cacctatgtg tataaccatc 
gcctgcgaga tctggccgtg gctgtggaac 
tcatcacgtg gggggcagat accgccgcgt 
ctgcccgtag gggccaggag atactgcttg 
ggaggttgct ggcgcccatc acggcgtacg 
taatcaccag cctgactggc cgggacaaaa 
caactgctac ccaaaccttc ctggcaacgt 
acggggccgg aacgaggacc atcgcatcac 
atgtggacca agaccttgtg ggctggcccg 
gtacctgcgg ctcctcggac ctttacctgg 
gccggcgagg tgatagcagg ggtagcctgc 
gctcctcggg gggtccgctg ttgtgccccg 
cggtgtgcac ccgtggagtg gctaaagcgg 
caaccatgag atccccggtg ttcacggaca 
tccaggtggc ccacctgcat gctcccaccg 
cgtacgcagc ccagggctac aaggtgttgg 
gctttggtgc ttacatgtcc aaggcccatg 
gaacaattac cactggcagc cccatcacgt 
gcgggtgctc aggaggtgct tatgacataa 
ccacatccat cttgggcatc ggcactgtcc 
tggttgtgct cgccactgct acccctccgg 
aggaggttgc tctgtccacc accggagaga 
aggtgatcaa ggggggaaga catctcatct 
tcgccgcgaa gctggtcgca ttgggcatca 
tgtctgtcat cccgaccagc ggcgatgttg 
gctttaccgg cgacttcgac tctgtgatag 
atttcagcct tgaccctacc tttaccattg 
ccaggactca acgccggggc aggactggca 
caccggggga gcgcccctcc ggcatgttcg 
cgggctgtgc ttggtatgag ctcacgcccg 
tgaacacccc ggggcttccc gtgtgccagg 
cgggcctcac tcatatagat gcccactttt 
ttccttacct ggtagcgtac caagccaccg 
cgtgggacca gatgtggaag tgtttgatcc 
ccctgctata cagactgggc gctgttcaga 
aatacatcat gacatgcatg tcggccgacc 
ttggcggcgt cctggctgct ctggccgcgt 
tgggcaggat cgtcttgtcc gggaagccgg 



tcttgcacgc tgcgagcgca gctagctgca 2640 
tggctgcttg gtacatcaag ggtcgggtag 2700 
tgtggtcctt tagcctactg ctcctagcat 2760 
aggtggccgc gtcgtgtggc ggcgttgttc 2820 
catattacaa gcgctatatc agctggtgca 2880 
tagaagcgca actgcacgtg tgggttcccc 2940 
tcatcttact catgtgtgta gtacacccga 3000 
tggccatctt cggacccctt tggattcttc 3060 
tgcgcgttca aggccttctc cggatctgcg 3120 
acgtgcaaat ggccatcatc aagttagggg 3180 
tcacccctct tcgagactgg gcgcacaacg 3240 
cagtcgtctt ctcccgaatg gagaccaagc 3300 
gcggtgacat catcaacggc ttgcccgtct 3360 
ggccagccga cggaatggtc tccaaggggt 3420 
cccagcagac gagaggcctc ctagggtgta 3480 
accaagtgga gggtgaggtc cagatcgtgt 3540 
gcatcaatgg ggtatgctgg actgtctacc 3600 
ccaagggtcc tgtcatccag atgtatacca 3660 
ctcctcaagg ttcccgctca ttgacaccct 3720 
tcacgaggca cgccgatgtc attcccgtgc 3780 
tttcgccccg gcccatttcc tacttgaaag 3840 
cgggacacgc cgtgggccta ttcagggccg 3900 
tggactttat ccctgtggag aacctaggga 3960 
actcctctcc accagcagtg ccccagagct 4020 
gcagcggtaa gagcaccaag gtcccggctg 4080 
tgctcaaccc ctctgttgct gcaacgctgg 4140 
gggttgatcc taatatcagg accggggtga 4200 
actccaccta cggcaagttc cttgccgacg 4260 
taatttgtga cgagtgccac tccacggatg 4320 
ttgaccaagc agagactgcg ggggcgagac 4380 
gctccgtcac tgtgtcccat cctaacatcg 4440 
tcccctttta cggcaaggct atccccctcg 4500 
tctgccactc aaagaagaag tgcgacgagc 4560 
atgccgtggc ctactaccgc ggtcttgacg 4620 
tcgtcgtgtc gaccgatgct ctcatgactg 4680 
actgcaacac gtgtgtcact cagacagtcg 4740 
agacaaccac gctcccccag gatgctgtct 4800 
gggggaagcc aggcatctat agatttgtgg 4860 
actcgtccgt cctctgtgag tgctatgacg 4920 
ccgagactac agttaggcta cgagcgtaca 4960 
accatcttga attttgggag ggcgtcttta 504 0 
tatcccagac aaagcagagt ggggagaact 5100 
tgtgcgctag ggctcaagcc cctcccccat 5160 
gccttaaacc caccctccat gggccaacac 5220 
atgaagtcac cctgacgcac ccaatcacca 5280 
tggaggtcgt cacgagcacc tgggtgctcg 5340 
attgcctgtc aacaggctgc gtggtcatag 5400 
caattatacc tgacagggag gttctctacc 5460 



49 



WO 00/75338 



PCT/US00/15446 



aggagttcga tgagatggaa gagtgctctc agcacttacc gtacatcgag caagggatga 5520 
tgctcgctga gcagttcaag cagaaggccc tcggcctcct gcagaccgcg tcccgccatg 5580 
cagaggttat cacccctgct gtccagacca actggcagaa actcgaggtc ttttgggcga 5640 
agcacatgtg gaatttcatc agtgggatac aatacttggc gggcctgtca acgctgcctg 5700 
gtaaccccgc cattgcttca ttgatggctt ttacagctgc cgtcaccagc ccactaacca 5760 
ctggccaaac cctcctcttc aacatattgg gggggtgggt ggctgcccag ctcgccgccc 5820 
ccggtgccgc tactgccttt gtgggtgctg gcctagctgg cgccgccatc ggcagcgttg 5880 
gactggggaa ggtcctcgtg gacattcttg cagggtatgg cgcgggcgtg gcgggagctc 5940 
ttgtagcatt caagatcatg agcggtgagg tcccctccac ggaggacctg gtcaatctgc 6000 
tgcccgccat cctctcgcct ggagcccttg tagtcggtgt ggtctgcgca gcaatactgc 6060 
gccggcacgt tggcccgggc gagggggcag tgcaatggat gaaccggcta atagccttcg 6120 
cctcccgggg gaaccatgtt tcccccacgc actacgtgcc ggagagcgat gcagccgccc 6180 
gcgtcactgc catactcagc agcctcactg taacccagct cctgaggcga ctgcatcagt 6240 
ggataagctc ggagtgtacc actccatgct ccggttcctg gctaagggac atctgggact 6300 
ggatatgcga ggtgctgagc gactttaaga cctggctgaa agccaagctc atgccacaac 6360 
tgcctgggat tccctttgtg tcctgccagc gcgggtatag gggggtctgg cgaggagacg 6420 
gcattatgca cactcgctgc cactgtggag ctgagatcac tggacatgtc aaaaacggga 6480 
cgatgaggat cgtcggtcct aggacctgca ggaacatgtg gagtgggacg ttccccatta 6540 
acgcctacac cacgggcccc tgtactcccc ttcctgcgcc gaactataag ttcgcgctgt 6600 
ggagggtgtc tgcagaggaa tacgtggaga taaggcgggt gggggacttc cactacgtat 6660 
cgggtatgac tactgacaat cttaaatgcc cgtgccagat cccatcgccc gaatttttca 6720 
cagaattgga cggggtgcgc ctacacaggt ttgcgccccc ttgcaagccc ttgctgcggg 6780 
aggaggtatc attcagagta ggactccacg agtacccggt ggggtcgcaa ttaccttgcg 6840 
agcccgaacc ggacgtagcc gtgttgacgt ccatgctcac tgatccctcc catataacag 6900 
cagaggcggc cgggagaagg ttggcgagag ggtcaccccc ttctatggcc agctcctcgg 6960 
ctagccagct gtccgctcca tctctcaagg caacttgcac cgccaaccat gactcccctg 7020 
acgccgagct catagaggct aacctcctgt ggaggcagga gatgggcggc aacat caeca 7080 
gggttgagtc agagaacaaa gtggtgattc tggactcctt cgatccgctt gtggcagagg 7140 
aggatgagcg ggaggtctcc gtacctgcag aaattctgcg gaagtctegg agattcgccc 7200 
gggccctgcc cgtctgggcg cggccggact acaacccccc gctagtagag acgtggaaaa 7260 
agectgacta cgaaccacct gtggtccatg gctgcccgct accacctcca cggtcccctc 7320 
ctgtgcctcc gecteggaaa aagegtaegg tggtcctcac cgaatcaacc ctatctactg 7380 
ccttggccga gcttgccacc aaaagttttg gcagctcctc aacttccggc attaegggeg 7440 
acaatacgac aacatcctct gagcccgccc cttctggctg cccccccgac tccgacgttg 7500 
agtcctattc ttccatgccc cccctggagg gggagcctgg ggatceggat ctcagcgacg 7560 
ggtcatggtc gaeggtcagt agtggggccg acaeggaaga tgtcgtgtgc tgctcaatgt 7620 
cttattcctg gaeaggegea ctcgtcaccc cgtgcgctgc ggaagaacaa aaactgccca 7680 
tcaacgcact gagcaactcg ttgctacgcc atcacaatct ggtgtattcc accacttcac 7740 
gcagtgcttg ccaaaggcag aagaaagtca catttgacag actgeaagtt ctggacagcc 7800 
attaccagga cgtgctcaag gaggtcaaag cagcggcgtc aaaagtgaag getaacttge 7860 
tatcegtaga ggaagcttgc agcctgacgc ccccacattc agccaaatcc aagtttggct 7920 
atggggcaaa agaegtcegt tgccatgcca gaaaggccgt agcccacatc aactccgtgt 7980 
ggaaagacct tctggaagac agtgtaacac caatagacac taccatcatg gecaagaacg 8040 
aggttttctg cgttcagcct gagaaggggg gtegtaagee agctcgtctc atcgtgttcc 8100 
ccgacctggg cgtgcgcgtg tgegagaaga tggccctgta cgacgtggtt agcaagctcc 8160 
ccctggccgt gatgggaagc tectaeggat tccaatactc accaggacag cgggttgaat 8220 
tcctcgtgca agcgtggaag tccaagaaga ccccgatggg gttctegtat gatacccgct 8280 
gttttgactc cacagtcact gagagegaca teegtaegga ggaggcaatt taccaatgtt 8340 
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gtgacctgga cccccaagcc cgcgtggcca 
ggggccctct taccaattca aggggggaaa 
gcgtactgac aactagctgt ggtaacaccc 
gtcgagccgc agggctccag gactgcacca 
tctgtgaaag tgcgggggtc caggaggacg 
tgaccaggta ctccgccccc cccggggacc 
taacatcatg ctcctccaac gtgtcagtcg 
accttacccg tgaccctaca acccccctcg 
ctccagtcaa ttcctggcta ggcaacataa 
tgatactgat gacccatttc tttagcgtcc 
ttaactgtga gatctacgga gcctgctact 
ttcaaagact ccatggcctc agcgcatttt 
atagggtggc cgcatgcctc agaaaacttg 
gggcccggag cgtccgcgct aggcttctgt 
agtacctctt caactgggca gtaagaacaa 
gccggctgga cttgtccggt tggttcacgg 
gcgtgtctca tgcccggccc cgctggttct 
taggcatcta cctcctcccc aaccgatgaa 
catttcctgt tttttttttt tttttttttt 
ttcttttttt cctttctttt tcccttcttt 
gctagctgtg aaaggtccgt gagccgcatg 
gcagatcatg t 



tcaagtccct cactgagagg ctttatgttg 8400 
actgcggcta ccgcaggtgc cgcgcgagcg 8460 
tcacttgcta catcaaggcc cgggcagcct 8520 
tgctcgtgtg tggcgacgac ttagtcgtta 8580 
cggcgagcct gagagccttc acggaggcta 8640 
ccccacaacc agaatacgac ttggagctta 8700 
cccacgacgg cgctggaaag agggtctact 8760 
cgagagccgc gtgggagaca gcaagacaca 8820 
tcatgtttgc ccccacactg tgggcgagga 8880 
tcatagccag ggatcagctt gaacaggctc 8940 
ccatagaacc actggatcta cctccaatca 9000 
cactccacag ttactctcca ggtgaaatca 9060 
gggtcccgcc cttgcgagct tggagacacc 9120 
ccagaggagg cagggctgct atatgtggca 9180 
agctcaaact cactccaata gcggccgctg 9240 
ctggctacag cgggggagac atttatcaca 9300 
ggttttgcct actcctgctc gctgcagggg 9360 
ggttggggta aacactccgg cctcttaagc 9420 
tttttttctt tttttttttc tttcctttcc 9480 
aatggtggct ccatcttagc cctagtcacg 9540 
actgcagaga gtgctgatac tggcctctct 9600 

9611 



<210> 8 
<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 8 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 
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Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys . Asn lie Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr lie Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 
325 330 335 



Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 
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Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 
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Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr val Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 

Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 

Ala Cys Leu Trp Met Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val He Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val He Phe Phe Val Ala Ala Trp Tyr He Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Leu Asp Thr 
805 810 815 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr He Ser Trp Cys Met Trp 
835 840 845 



Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
850 855 860 
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Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp lie Thr Lys Leu Leu 
885 890 895 

Leu Ala lie Phe Gly Pro Leu Trp lie Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg lie Cys Ala Leu 
915 920 925 

Ala Arg Lys He Ala Gly Gly His Tyr Val Gin Met Ala He He Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 

Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Arg Gly Gin Glu He Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
1105 1110 1115 1120 
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Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 
1365 1370 1375 
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lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val lie Lys Gly Gly 
1380 1385 1390 

Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val Ser 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr Arg 
1490 1495 1500 

Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu He 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 
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Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro He Thr Lys Tyr 
1635 1640 1645 

He Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 
1665 1670 1675 1680 

Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys Pro 
1685 1690 1695 

Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu 
1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 
1745 1750 1755 1760 

Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys He 
1860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
1875 1880 1885 
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Ala lie Leu Ser Pro Gly Ala Leu Val Val Qly Val Val Cys Ala Ala 
1890 1895 1900 

lie Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu lie Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala lie Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly He Met His Thr Arg 
2020 2025 2030 

Cys His Cys Gly Ala Glu He Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

He Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
2130 2135 2140 
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Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Gly Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 
2210 2215 2220 

Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu 
2225 2230 2235 2240 

Met Gly Gly Asn He Thr Arg Val Glu Ser Glu Asn Lys Val Val He 
2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 
2260 2265 2270 

Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 
2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 
2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 
2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 
2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly He Thr Gly Asp Asn 
2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 
2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 
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Aap Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 

Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His lie Asn 
2530 2535 2540 

Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro lie Asp Thr 
2545 2550 2555 2560 

Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 

Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg 
2610 2615 2620 

Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly 
2625 2630 2635 2640 

Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp 
2645 2650 2655 
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lie Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

lie Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val lie Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 

Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly 
2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

Leu Gly Asn He He Met Phe Ala Pro Thr Leu Trp Ala Arg Met He 
2835 2840 2845 

Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu He Tyr Gly Ala Cys Tyr Ser He Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro He lie Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala Cys 
2900 2905 2910 
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Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala lie 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro lie Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp lie Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

lie Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 9 
<211> 9611 
<212> DNA 

<213> Hepatitis C virus 
<400> 9 

gccagccccc tgatgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gataaacccg 
gcaagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaaga aacaccaacc 
gcggccagat cgttggcgga gtatacttgt 
gcgcgacaag gaagacttcg gagcggtccc 
aagatcggcg ctccactggc aaatcctggg 
ggaatgaggg actcggctgg gcaggatggc 
ggggccccaa tgacccccgg cataggtcgc 
cgtgcggctt tgccgacctc atggggtaca 
tcgccagagc tctcgcgcat ggcgtgagag 
ggaacttacc cggttgctcc ttttctatct 
ccccggtctc cgctgccgaa gtgaagaaca 
gcaccaatga cagcattacc tggcagctcc 
tcccgtgcga gaaagtgggg aatgcatctc 
ccgtgcagcg gcccggcgcc ctcacgcagg 
tgtccgccac gctctgctct gccctctacg 



catgaatcac 


tcccctgtga 


ggaactactg 


60 


ttagtatgag 


tgtcgtgcag 


cctccaggac 


120 


cggaaccggt 


gagtacaccg 


gaattgccag 


180 


ctcaatgcct 


ggagatttgg 


gcgtgccccc 


240 


gcgaaaggcc 


ttgtggtact 


gcctgatagg 


300 


gaccgtgcac 


catgagcaca 


aatcctaaac 


360 


gtcgcccaca 


agacgttaag 


tttccgggcg 


420 


tgccgcgcag 


gggccccagg 


ttgggtgtgc 


480 


agccacgtgg 


aaggcgccag 


cccatcccta 


540 


gaaaaccagg 


atacccctgg 


cccctatacg 


600 


tcctgtcccc 


ccgaggttcc 


cgtccctctt 


660 


gcaacgtggg 


taaggtcatc 


gataccctaa 


720 


tccctgtcgt 


gggcgccccg 


ctcggcggcg 


780 


tcctggagga 


cggggttaat 


tttgcaacag 


840 


tcttgctggc 


cctgctgtcc 


tgcatcacca 


900 


tcagtaccgg 


ctacatggtg 


actaacgact 


960 


aggctgctgt 


cctccacgtc 


cccgggtgcg 


1020 


agtgctggat 


accggtctca 


ccgaatgtgg 


1080 


gcttgcggac 


gcacatcgac 


atggttgtga 


1140 


tgggggacct 


ctgcggtggg 


gtgatgctcg 


1200 
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cagcccaaat gttcattgtc tcgccgcagc accactggtt tgtccaagac tgcaattgct 1260 
ccatctaccc tggtaccatc actggacacc gcatggcatg ggacatgatg atgaactggt 1320 
cgcccacggc taccatgatc ttggcgtacg cgatgcgtgt ccccgaggtc attatagaca 1380 
tcattagcgg ggctcattgg ggcgtcatgt tcggcttggc ctacttctct atgcagggag 1440 
cgtgggcgaa agtcgttgtc atccttctgt tggccgccgg ggtggacgcg cgcacccata 1500 
ctgttggggg ttctgccgcg cagaccaccg ggcgcctcac cagcttattt gacatgggcc 1560 
ccaggcagaa aatccagctc gttaacacca atggcagctg gcacatcaac cgcaccgccc 1620 
tgaactgcaa tgactccttg cacaccggct ttatcgcgtc tctgttctac acccacagct 1680 
tcaactcgtc aggatgtccc gaacgcatgt ccgcctgccg cagtatcgag gccttccggg 1740 
tgggatgggg cgccttgcaa tatgaggata atgtcaccaa tccagaggat atgagaccct 1800 
attgctggca ctacccacca aggcagtgtg gcgtggtctc cgcgaagact gtgtgtggcc 1860 
cagtgtactg tttcaccccc agcccagtgg tagtgggcac gaccgacagg cttggagcgc 1920 
ccacttacac gtggggggag aatgagacag atgtcttcct attgaacagc actcgaccac 1980 
cgctggggtc atggttcggc tgcacgtgga tgaactcttc tggctacacc aagacttgcg 2040 
gcgcaccacc ctgccgtact agagctgact tcaacgccag cacggacctg ttgtgcccca 2100 
cggactgttt taggaagcat cctgatacca cttacctcaa atgcggctct gggccctggc 2160 
tcacgccaag gtgcctgatc gactacccct acaggctctg gcattacccc tgcacagtta 2220 
actataccat cttcaaaata aggatgtatg tgggaggggt tgagcacagg ctcacggctg 2280 
catgcaattt cactcgtggg gatcgttgca acttggagga cagagacaga agtcaactgt 2340 
ctcctttgtt gcactccacc acggaatggg ccattttacc ttgctcttac tcggacctgc 2400 
ccgccttgtc gactggtctt ctccacctcc accaaaacat cgtggacgta caattcatgt 2460 
atggcctatc acctgccctc acaaaataca tcgtccgatg ggagtgggta atactcttat 2520 
tcctgctctt agcggacgcc agggtttgcg cctgcttatg gatgctcatc ttgttgggcc 2580 
aggccgaagc agctttggag aacctcgtaa tactcaatgc agcatccctg gccgggacgc 2640 
acggtcttgt gtccttcctc gtgttcttct gctttgcgtg gtatctgaag ggtaggtggg 2700 
tgcccggagc ggtctacgcc ctctacggga tgtggcctct cctcctgctc ctgctggcgt 2760 
tgcctcagcg ggcatatgca ctggacacgg aggtggccgc gtcgtgtggc ggcgttgttc 2820 
ttgtcgggtt aatggcgctg actctgtcgc catattacaa gcgctatatc agctggtgca 2880 
tgtggtggct tcagtatttt ctgaccagag tagaagcgca actgcacgtg tgggttcccc 2940 
ccctcaacgt ccgggggggg cgcgatgccg tcatcttact catgtgtgta gtacacccga 3000 
ccctggtatt tgacatcacc aaactactcc tggccatctt cggacccctt tggattcttc 3060 
aagccagttt gcttaaagtc ccctacttcg tgcgcgttca aggccttctc cggatctgcg 3120 
cgctagcgcg gaagatagcc ggaggtcatt acgtgcaaat ggccatcatc aagttagggg 3180 
cgcttactgg cacctatgtg tataaccatc tcacccctct tcgagactgg gcgcacaacg 3240 
gcctgcgaga tctggccgtg gctgtggaac cagtcgtctt ctcccgaatg gagaccaagc 3300 
tcatcacgtg gggggcagat accgccgcgt gcggtgacat catcaacggc ttgcccgtct 3360 
ctgcccgtag gggccaggag atactgcttg ggccagccga cggaatggtc tccaaggggt 3420 
ggaggttgct ggcgcccatc acggcgtacg cccagcagac gagaggcctc ctagggtgta 3480 
taatcaccag cctgactggc cgggacaaaa accaagtgga gggtgaggtc cagatcgtgt 3540 
caactgctac ccaaaccttc ctggcaacgt gcatcaatgg ggtatgctgg actgtctacc 3600 
acggggccgg aacgaggacc atcgcatcac ccaagggtcc tgtcatccag atgtatacca 3660 
atgtggacca agaccttgtg ggctggcccg ctcctcaagg ttcccgctca ttgacaccct 3720 
gtacctgcgg ctcctcggac ctttacctgg tcacgaggca cgccgatgtc attcccgtgc 3780 
gccggcgagg tgatagcagg ggtagcctgc tttcgccccg gcccatttcc tacttgaaag 3840 
gctcctcggg gggtccgctg ttgtgccccg cgggacacgc cgtgggccta ttcagggccg 3900 
cggtgtgcac ccgtggagtg gctaaagcgg tggactttat ccctgtggag aacctaggga 3960 
caaccatgag atccccggtg ttcacggaca actcctctcc accagcagtg ccccagagct 4020 
tccaggtggc ccacctgcat gctcccaccg gcagcggtaa gagcaccaag gtcccggctg 4080 
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cgtacgcagc ccagggctac aaggtgttgg tgctcaaccc ctctgttgct gcaacgctgg 4140 
gctttggtgc ttacatgtcc aaggcccatg gggttgatcc taatatcagg accggggtga 4200 
gaacaattac cactggcagc cccatcacgt actccaccta cggcaagttc cttgccgacg 4260 
gcgggtgctc aggaggtgct tatgacataa taatttgtga cgagtgccac tccacggatg 4320 
ccacatccat cttgggcatc ggcactgtcc ttgaccaagc agagactgcg ggggcgagac 4380 
tggttgtgct cgccactgct acccctccgg gctccgtcac tgtgtcccat cctaacatcg 4440 
aggaggttgc tctgtccacc accggagaga tcccctttta cggcaaggct atccccctcg 4500 
aggtgatcaa ggggggaaga catctcatct tctgccactc aaagaagaag tgcgacgagc 4560 
tcgccgcgaa gctggtcgca ttgggcatca atgccgtggc ctactaccgc ggtcttgacg 4620 
tgtctgtcat cccgaccagc ggcgatgttg tcgtcgtgtc gaccgatgct ctcatgactg 4680 
gctttaccgg cgacttcgac tctgtgatag actgcaacac gtgtgtcact cagacagtcg 4740 
atttcagcct tgaccctacc tttaccattg agacaaccac gctcccccag gatgctgtct 4800 
ccaggactca acgccggggc aggactggca gggggaagcc aggcatctat agatttgtgg 4860 
caccggggga gcgcccctcc ggcatgttcg actcgtccgt cctctgtgag tgctatgacg 4920 
cgggctgtgc ttggtatgag ctcacgcccg ccgagactac agttaggcta cgagcgtaca 4 980 
tgaacacccc ggggcttccc gtgtgccagg accatcttga attttgggag ggcgtcttta 5040 
cgggcctcac tcatatagat gcccactttt tatcccagac aaagcagagt ggggagaact 5100 
ttccttacct ggtagcgtac caagccaccg tgtgcgctag ggctcaagcc cctcccccat 5160 
cgtgggacca gatgtggaag tgtttgatcc gccttaaacc caccctccat gggccaacac 5220 
ccctgctata cagactgggc gctgttcaga atgaagtcac cctgacgcac ccaat caeca 5280 
aatacatcat gaeatgeatg tcggccgacc tggaggtcgt cacgagcacc tgggtgctcg 5340 
ttggcggcgt cctggctgct ctggccgcgt attgcctgtc aacaggctgc gtggtcatag 5400 
tgggcaggat cgtcttgtcc gggaagcegg caattatacc tgacagggag gttctctacc 5460 
aggagttcga tgagatggaa gagtgetetc agcacttacc gtacatcgag caagggatga 5520 
tgctcgctga gcagttcaag cagaaggccc tcggcctcct gcagaccgcg tcccgccatg 5580 
cagaggttat cacccctgct gtccagacca actggcagaa actcgaggtc ttttgggcga 5640 
agcacatgtg gaatttcatc agtgggatac aatacttggc gggcctgtca acgctgcctg 5700 
gtaaccccgc cattgettea ttgatggctt ttacagctgc cgtcaccagc ccactaacca 5760 
ctggccaaac cctcctcttc aacatattgg gggggtgggt ggctgcccag ctcgccgccc 5820 
ccggtgccgc tactgecttt gtgggtgctg gectagctgg cgccgccatc ggcagcgttg 5880 
gactggggaa ggtcctcgtg gacattcttg cagggtatgg cgcgggcgtg gegggagetc 5940 
ttgtagcatt caagatcatg ageggtgagg tcccctccac ggaggacctg gtcaatctgc 6000 
tgcccgccat cctctcgcct ggagcccttg tagtcggtgt ggtctgegea geaatactge 6060 
gccggcacgt tggcccgggc gagggggcag tgcaatggat gaaceggcta atagectteg 6120 
cctcccgggg gaaccatgtt tcccccacgc actacgtgcc ggagagegat gcagccgccc 6180 
gcgtcactgc catactcagc agcctcactg taacccagct ectgaggega ctgcatcagt 6240 
ggataagctc ggagtgtacc actccatgct ccggttcctg gctaagggac atctgggact 6300 
ggatatgega ggtgctgagc gactttaaga cctggctgaa agccaagctc atgccacaac 6360 
tgcctgggat tccctttgtg tcctgccagc gegggtatag gggggtctgg cgaggagacg 6420 
geattatgea cactcgctgc cactgtggag ctgagatcac tggacatgtc aaaaacggga 64 80 
cgatgaggat cgtcggtcct aggacctgea ggaacatgtg gagtgggacg ttccccatta 6540 
acgcctacac cacgggcccc tgtactcccc ttcctgcgcc gaactataag ttcgcgctgt 6600 
ggagggtgtc tgcagaggaa tacgtggaga taaggcgggt gggggacttc cactaegtat 6660 
egggtatgae tactgacaat ettaaatgee cgtgccagat cccatcgccc gaatttttca 6720 
cagaattgga eggggtgege ctacacaggt ttgcgccccc ttgcaagccc ttgctgcggg 6780 
aggaggtatc attcagagta ggactccacg agtacccggt ggggtcgcaa ttaccttgcg 6840 
agcccgaacc ggaegtagee gtgttgacgt ccatgctcac tgatccctcc catataacag 6900 
cagaggegge egggagaagg ttggcgagag ggtcaccccc ttctatggcc agctcctcgg 6960 
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ctagccagct gtccgctcca tctctcaagg caacttgcac cgccaaccat gactcccctg 7020 
acgccgagct catagaggct aacctcctgt ggaggcagga gatgggcggc aacatcacca 7080 
gggttgagtc agagaacaaa gtggtgattc tggactcctt cgatccgctt gtggcagagg 7140 
aggatgagcg ggaggtctcc gtacctgcag aaattctgcg gaagtctcgg agattcgccc 7200 
gggccctgcc cgtctgggcg cggccggact acaacccccc gctagtagag acgtggaaaa 7260 
agcctgacta cgaaccacct gtggtccatg gctgcccgct accacctcca cggtcccctc 7320 
ctgtgcctcc gcctcggaaa aagcgtacgg tggtcctcac cgaatcaacc ctatctactg 7380 
ccttggccga gcttgccacc aaaagttttg gcagctcctc aacttccggc attacgggcg 7440 
acaatacgac aacatcctct gagcccgccc cttctggctg cccccccgac tccgacgttg 7500 
agtcctattc ttccatgccc cccctggagg gggagcctgg ggatccggat ctcagcgacg 7560 
ggtcatggtc gacggtcagt agtggggccg acacggaaga tgtcgtgtgc tgctcaatgt 7620 
cttattcctg gacaggcgca ctcgtcaccc cgtgcgctgc ggaagaacaa aaactgccca 7680 
tcaacgcact gagcaactcg ttgctacgcc atcacaatct ggtgtattcc accacttcac 7740 
gcagtgcttg ccaaaggcag aagaaagtca catttgacag actgcaagtt ctggacagcc 7800 
attaccagga cgtgctcaag gaggtcaaag cagcggcgtc aaaagtgaag gctaacttgc 7860 
tatccgtaga ggaagcttgc agcctgacgc ccccacattc agccaaatcc aagtttggct 7920 
atggggcaaa agacgtccgt tgccatgcca gaaaggccgt agcccacatc aactccgtgt 7980 
ggaaagacct tctggaagac agtgtaacac caatagacac taccatcatg gccaagaacg 8040 
aggttttctg cgttcagcct gagaaggggg gtcgtaagcc agctcgtctc atcgtgttcc 8100 
ccgacctggg cgtgcgcgtg tgcgagaaga tggccctgta cgacgtggtt agcaagctcc 8160 
ccctggccgt gatgggaagc tcctacggat tccaatactc accaggacag cgggttgaat 8220 
tcctcgtgca agcgtggaag tccaagaaga ccccgatggg gttctcgtat gatacccgct 8280 
gttttgactc cacagtcact gagagcgaca tccgtacgga ggaggcaatt taccaatgtt 8340 
gtgacctgga cccccaagcc cgcgtggcca tcaagtccct cactgagagg ctttatgttg 8400 
ggggccctct taccaattca aggggggaaa actgcggcta ccgcaggtgc cgcgcgagcg 8460 
gcgtactgac aactagctgt ggtaacaccc tcacttgcta catcaaggcc cgggcagcct 8520 
gtcgagccgc agggctccag gactgcacca tgctcgtgtg tggcgacgac ttagtcgtta 8580 
tctgtgaaag tgcgggggtc caggaggacg cggcgagcct gagagccttc acggaggcta 864 0 
tgaccaggta ctccgccccc cccggggacc ccccacaacc agaatacgac ttggagctta 6700 
taacatcatg ctcctccaac gtgtcagtcg cccacgacgg cgctggaaag agggtctact 8760 
accttacccg tgaccctaca acccccctcg cgagagccgc gtgggagaca gcaagacaca 8820 
ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc ccccacactg tgggcgagga 8880 
tgatactgat gacccatttc tttagcgtcc tcatagccag ggatcagctt gaacaggctc 8940 
ttaactgtga gatctacgga gcctgctact ccatagaacc actggatcta cctccaatca 9000 
ttcaaagact ccatggcctc agcgcatttt cactccacag ttactctcca ggtgaaatca 9060 
atagggtggc cgcatgcctc agaaaacttg gggtcccgcc cttgcgagct tggagacacc 9120 
gggcccggag cgtccgcgct aggcttctgt ccagaggagg cagggctgct atatgtggca 9180 
agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 
gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 9300 
gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 
taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 9420 
catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 9480 
ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 9540 
gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 9600 
gcagatcatg t 9611 



<210> 10 



66 



WO 00/75338 



PCT/US00/15446 



<211> 3015 
<212> PRT 

<213> Hepatitis C virus 
<400> 10 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Ala 
180 185 190 

Glu Val Lys Asn He Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp He 
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225 



230 



235 



240 



Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His lie Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser He Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 



Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
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485 



490 



495 



Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 

Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu Hi 8 Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 

Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 



Ala Cys Leu Trp Met Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu 
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740 745 750 

Glu Asn Leu Val lie Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
755 760 765 

Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
770 775 780 

Arg Trp Val Pro Gly Ala Val Tyr Ala Leu Tyr Gly Met Trp Pro Leu 
785 790 795 800 

Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu Asp Thr 
805 810 815 

Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met Ala 
820 825 830 

Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr He Ser Trp Cys Met Trp 
835 • 840 845 

Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp 
850 855 860 

Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val He Leu Leu 
865 870 875 880 

Met Cys Val Val His Pro Thr Leu Val Phe Asp He Thr Lys Leu Leu 
885 890 895 

Leu Ala He Phe Gly Pro Leu Trp He Leu Gin Ala Ser Leu Leu Lys 
900 905 910 

Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg He Cys Ala Leu 
915 920 925 

Ala Arg Lys He Ala Gly Gly His Tyr Val Gin Met Ala He He Lys 
930 935 940 

Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu 
945 950 - 955 960 

Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala 
980 985 990 

Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala 
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995 1000 1005 

Arg Arg Gly Gin Glu lie Leu Leu Gly Pro Ala Asp Gly Met Val Ser 
1010 1015 1020 

Lys Gly Trp Arg Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Cys I1 e He Thr Ser Leu Thr Gly Arg Asp Lys 
1045 1050 1055 

Asn Gin Val Glu Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr 
1060 1065 1070 

Phe Leu Ala Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met 
1090 1095 1100 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
1105 1110 1115 1120 

Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser 
1140 1145 1150 

Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Asn Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
1205 1210 1215 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
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1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 
1265 1270 1275 1280 

Asn lie Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr Asp Ala Thr 
1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu 
1365 1370 1375 

He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He Lys Gly Gly 
1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val Ser 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr Arg 
1490 1495 1500 

Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val 
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1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 

Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr 
1635 1640 1645 

He Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser 
1665 1670 1675 1680 

Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys Pro 
1685 1690 1695 

Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu 
1715 1720 1725 

Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1730 1735 1740 

Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys 
1745 1750 1755 1760 

Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
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1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala lie Ala 
1780 1785 1790 

Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly 
1795 1800 1805 

Gin Thr Leu Leu Phe Asn lie Leu Gly Gly Trp Val Ala Ala Gin Leu 
1810 1815 1820 

Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly 
1825 1830 1835 1840 

Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys He 
1860 1865 1870 

Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro 
1875 1880 1885 

Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr 
1925 1930 1935 

His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu 
1940 1945 1950 

Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He 
1955 1960 1965 

Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
1970 1975 1980 

Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys 
1985 1990 1995 2000 

Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly lie Met His Thr Arg 
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2020 2025 2030 

Cys His Cys Gly Ala Glu lie Thr Gly His Val Lys Asn Gly Thr Met 
2035 2040 2045 

Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe 
2050 2055 2060 

Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro 
2065 2070 2075 2080 

Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu 
2085 2090 2095 

He Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Cys Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr Glu 
2115 2120 2125 

Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu 
2130 2135 2140 

Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Gly Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp 
2210 2215 2220 

Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu 
2225 2230 2235 2240 

Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu Asn Lys Val Val He 
2245 2250 2255 

Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val 
2260 2265 2270 

Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala 
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2275 2280 2285 

Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr 
2290 2295 2300 

Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu 
2305 2310 2315 2320 

Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
2325 2330 2335 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala 
2340 2345 2350 

Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly lie Thr Gly Asp Asn 
2355 2360 2365 

Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser 
2370 2375 2380 

Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala 
2405 2410 2415 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly 
2420 2425 2430 

Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn 
2435 2440 2445 

Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
2450 2455 2460 

Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg 
2465 2470 2475 2480 

Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys 
2485 2490 2495 

Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
2500 2505 2510 

Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly 
2515 2520 2525 

Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His lie Asn 
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2530 2535 2540 

Ser Val Trp Lys Asp Leu Leu Qlu Asp Ser Val Thr Pro He Asp Thr 
2545 2550 2555 2560 

Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly 
2565 2570 2575 

Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val Arg 
2580 2585 2590 

Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu 
2595 2600 2605 

Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg 
2610 2615 2620 

Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly 
2625 2630 2635 2640 

Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp 
2645 2650 2655 

He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro Gin 
2660 2665 2670 

Ala Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
2675 2680 2685 

Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg 
2690 2695 2700 

Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr 
2705 2710 2715 2720 

He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr 
2725 2730 2735 

Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly 
2740 2745 2750 

Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr 
2755 2760 2765 

Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu 
2770 2775 2780 

Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly 
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2785 2790 2795 2800 

Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu 
2805 2810 2815 

Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp 
2820 2825 2830 

Leu Gly Asn lie lie Met Phe Ala Pro Thr Leu Trp Ala Arg Met He 
2835 2840 2845 

Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg Asp Gin Leu Glu 
2850 2855 2860 

Gin Ala Leu Asn Cys Glu He Tyr Gly Ala Cys Tyr Ser He Glu Pro 
2865 2870 2875 2880 

Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly Leu Ser Ala Phe 
2885 2890 2895 

Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala Cys 
2900 2905 2910 

Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala 
2915 2920 2925 

Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala lie 
2930 2935 2940 

Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu 
2945 2950 2955 2960 

Thr Pro He Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr 
2965 2970 2975 

Ala Gly Tyr Ser Gly Gly Asp He Tyr His Ser Val Ser His Ala Arg 
2980 2985 2990 

Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly 
2995 3000 3005 

lie Tyr Leu Leu Pro Asn Arg 
3010 3015 



<210> 11 
<211> 24 
<212> DNA 
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<213> Hepatitis C virus 
<400> 11 

actggacacg gaggtggccg cgtc 24 



<210> 12 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 
<400> 12 

ttgttcttgt cgggttaatg gcgc 24 



<210> 13 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 
<400> 13 

gggtgtacta cacacatgag taag 24 



<210> 14 
<211> 22 
<212> DNA 

<213> Hepatitis C virus 
<400> 14 

aagcgcccct aacttgatga tg 22 



<210> 15 
<211> 40 
<212> DNA 

<213> Hepatitis C virus 
<400> 15 

cgtcatcgat acctcagcgg gcatatgcac tggacacgga 40 



<210> 16 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 
<400> 16 
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gtccagtgca tatgcccgct gagg 24 



<210> 17 
<211> 32 
<212> DNA 

<213> Hepatitis C virus 
<400> 17 

catgcaccag ctgatatagc gcttgtaata tg 32 



<210> 18 
<211> 30 
<212> DMA 

<213> Hepatitis C virus 
<400> 18 

tccgtagagg aagcttgcag cctgacgccc 30 



<210> 19 
<211> 34 
<212> DNA 

<213> Hepatitis C virus 
<400> 19 

cagaggaggc agggctgcta tatgtggcaa gtac 34 



<210> 20 
<211> 34 
<212> DNA 

<213> Hepatitis C virus 
<400> 20 

gtacttgcca catatagcag ccctgcctcc tctg 34 



<210> 21 
<211> 43 
<212> DNA 

<213> Hepatitis C virus 
<400> 21 

cgtctctaga caggaaatgg cttaagaggc cggagtgttt acc 43 
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<210> 22 
<211> 65 
<212> DNA 

<213> Hepatitis C virus 
<400> 22 

ttatggatgc tcatcttgtt gggccaggcc gaagcagctt tggagaacct cgtaatactc 60 
aatgc 65 



<210> 23 
<211> 32 
<212> DNA 

<213> Hepatitis C virus 
<400> 23 

aggatttgtg ctcatggtgc acggtctacg ag 32 



<210> 24 
<211> 50 
<212> DNA 

<213> Hepatitis C virus 
<400> 24 

ttttttttgc ggccgctaat acgactcact atagacccgc ccctaatagg 50 



<210> 25 
<211> 31 
<212> DNA 

<213> Hepatitis C virus 
<400> 25 

ccgtgcacca tgagcacaaa tcctaaacct c 31 



<210> 26 
<211> 26 
<212> DNA 

<213> Hepatitis C virus 
<400> 26 

ggatgtaccc catgaggtcg gcaaag 26 



<210> 27 
<211> 30 
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<212> DNA 



<213> Hepatitis C virus 



<400> 27 



gtttgcgcct gcttatggat gctcatcttg 



30 



<210> 28 
<211> 26 
<212> DNA 

<213> Hepatitis C virus 
<400> 28 

gcgtcataag catatgcctg ttgggg 26 

<210> 29 
<211> 23 
<212> DNA 

<213> Hepatitis C virus 



<210> 30 
<211> 39 
<212> DNA 

<213> Hepatitis C virus - 
<400> 30 

cgtcatgcat acccctaggg cggctctcat tgaagaggg 39 

<210> 31 
<211> 30 
<212> DNA 

<213> Hepatitis C virus 



<400> 29 



ccctcagcac tggagtacat ctg 



23 



<400> 31 



cgtcccctct tcaatgagag ccgctctaga 



30 



<210> 32 



<211> 28 



<212> DNA 



<213> Hepatitis C virus 



82 



WO 00/75338 



<400> 32 

gcggtgaaga ccaagctcaa actcactc 



<210> 33 
<211> 41 
<212> DNA 

<213> Hepatitis C virus 
<400> 33 

aatctagaag gcgcgcttcc ggcaatggag tgagtttgag 



<210> 34 
<211> 38 
<212> DNA 

<213> Hepatitis C virus 
<400> 34 

cgtctctaga ggataaatcc aggaggcgcg cttccggc 



<210> 35 
<211> 27 
<212> DNA 

<213> Hepatitis C virus 
<400> 35 

tactttttgt aggggtaggc cttttcc 



<210> 36 
<211> 34 
<212> DNA 

<213> Hepatitis C virus 
<400> 36 

cgtctctaga gtgtagctaa tgtgtgccgc tcta 



<210> 37 
<211> 24 
<212> DNA 

<213> Hepatitis C virus 
<400> 37 

ctatggagtg tagctaatgt gtgc 
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<210> 38 
<211> 66 
<212> DNA 

<213> Hepatitis C virus 
<400> 38 

cgtctctaga catgatctgc agagagacca gttacggcac tctctgcagt catgcggctc 60 
acggac 66 



<210> 39 
<211> 41 
<212> DNA 

<213> Hepatitis C virus 
<400> 39 

ctttcacagc tagccgtgac tagggctaag atggagccac c 41 
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